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(1) GENERAL INFORMATION: 
(i) APPLICANT: 




SEQUENCE LISTING 



Hilton, Douglc 
Al exander , War 1 
Viney, Elizabeth M. 
Willson, Tracy A. 
Richardson, E&achael T, 
Starr, Rob} 
Nicholson, iSandra E. 
Metcalf, Dpnald 
Nicola, n/cos A. 



*(ii) TITLE OF INVENTION: THI 
(iii) NUMBER OF SEQUENCES: 



^PEUTIC AND DIAGNOSTIC AGENTS 



(iv) CORRESPONDENCE ADDRE< 

(A) ADDRESSEE: Scu/Lly, Scott, Murphy & Presser 

(B) STREET: 4 00 Garden City Plaza 

(C) CITY: Garden/City, New York 

(D) STATE: New ^brk 

(E) COUNTRY: U.jS.A. 

(F) ZIP: 11530, 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER:/ IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US APPLICATION NO. 08/9 62,560 

(B) FILINJ5 DATE: 31-OCT-1997 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/AU97 / 0 072 9 

(B) FIlANG DATE: 31-OCT-1997 



(vi) PRIOR 
(A) 
(B) 



'PLICATION DATA: 
'PLICATION NUMBER: P05117 
DATE: 14-FEB-1997 



(vii) PRI0SR APPLICATION DATA: 

(A)/ APPLICATION NUMBER: PO 3384 
(BY FILING DATE: 01-NOV-1996 



(viii) ATTORNEY /AGENT INFORMATION: 

(k) NAME: DiGiglio, Frank S. 
IB) REGISTRATION NO: 31,346 
;C) REFERENCE /DOCKET NUMBER: 10976 



(ix) TELECOMMUNICATION INFORMATION: 



A 
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(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
' (xi) SEQUENCE DESCRIPTION: SE6 ID NO : 1 



CACGCCGCCC ACGTGAAGGC 



20 



(2) INFORMATION FOR SEQ ID NO :yB : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base& pairs 

(B) TYPE: nucleic Acid 

(C) STRANDEDNESS : /single 
\ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DN^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
TTCGCCAATG ACAAGACGCT 



20 



(2) INFORMATION FOR SEQ ID NO : 3 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH:/ 12 3 6 base pairs 

(B) TYPE: micleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
* / 

(ii) MOLECULE t/tPE: DNA 

(ix) FEATURE: 

(A) NAM^/KEY: CDS 

(B) LOCATION: 1. .636 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 



CGAGGCTCAA GCTQCGGGCG GATTCTGCGT GCCGCTCTCG CTCCTTGGGG TCTGTTGGCC 



-101 
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GGCCTGTGCC ACCCGGACGC CCGGCTCACT GCCTCTGTCT /CCCCCATCAG CGCAGCCCCG -41 
GACGCTATGG CCCACCCCTC CAGCTGGCCC CTCGAGTAGfi -1 



ATG 
Met 
1 

GCA 
Ala 



TCG 
Ser 



GTA 
Val 



GAG 
Glu 



CCA 
Pro 



CCA 
Pro 



TAC 
Tyr 
65 

TAT 
Tyr 



GCC 
Ala 
50 

CGG 
Arg 



^ ( GAG 
Glu 



TTC 
Phe 



TGG 
Trp 



CCC 
Pro 



TTC 
Phe 



GTG 
Val 



TTC 
Phe 
145 

CGC 
Arg 



GAG 
Glu 



CAC 
His 
130 

GAC 
Asp 



ATG 
Met 



CTG 
Leu 



GCA CGC 
Ala Arg 



CCC CGA 
Pro Arg 
20 

GCG GCC 
Ala Ala 
35 

CCT GGC 
Pro Gly 



CGC ATC 
Arg lie 



GGA CCC 
Gly Pro 



GTG GGC 
Val Gly 
100 

GCG CTC 
Ala Leu 
115 

TTC CAG 
Phe Gin 



AAC CAG GTG GCA 
Asn Gin Val Ala 
5 

CGG CGG TCA GAG 
Arg Arg Ser Glu 



CCC GTG CGT CCC 
Pro Val Arg Pro 
40 

GAC ACT CAC TTC 
Asp Thr His Phey 
55 

ACG CGG ACC AG'C 
Thr Arg Thr £j4r 
70 

CTG AGC GTG /CAC 
Leu Ser Va/ His 
85 

ACC TTC T^G GTG 
Thr Phe Leu Val 



GCC GAC 
Ala 

U 

CCC 
Pro 

25. 

CGG? CCC 
'g Pro 



!GC ACC 
Arg Thr 



GCG CTC 
Ala Leu 



^T GCG 
Asn Ala 



TCG TCC 
Ser Ser 



AGC GTC 
Ser Ve 



GCC 
Ala /Gly 



TGC CTT 
Cys Leu 



TTG GGG 
Leu Gly 



GAG 
Glu 
150 

&CC CCG 
fAla Pro 
165 

CAG CGC 
Gin Arg 



AAG ATG 
Lys Met 
120 

CGC TTC 
Arg Phe 
135 

CTG CTG 
Leu Leu 



GGG GCG 
Gly Ala 
90 

CGC GAC 
Arg Asp 
105 

GCT TCG 
Ala Ser 



CAC TTG 
His Leu 



GAG CAC 
Glu His 



CTG CGC 
Leu Arg 



ATC GTG 
lie Val 



CAG CGC 
Gin Arg 
170 

GCC GCC 
Ala Ala 
185 



TGC CCG 
Cys Pro 



TTC CGC 
Phe Arg 
60 

CTG GAC 
Leu Asp 
75 

CAC GAG 
His Glu 



AGT CGT 
Ser Arg 



GGC CCC 
Gly Pro 



GAC GGC 
Asp Gly 
140 

TAC GTG 
Tyr Val 
155 

CGC GTG 
Arg Val 



ATC TCC CCG GCA 
lie Ser Pro Ala 
15 

TCG TCT TCG TCC 
Ser Ser Ser Ser 
30 

GCG GTC CCA GCC 
Ala Val Pro Ala 
45 

TCC CAC TCC GAT 
Ser His Ser Asp 



GCC TGC 
Ala Cys 



CGG CTG 
Arg Leu 



CAA CGG 
Gin Arg 
110 

ACG AGC 
Thr Ser 
125 

AGC CGC 
Ser Arg 



GGC TTC 
Gly Phe 
80 

CGT GCC 
Arg Ala 
95 

AAC TGC 
Asn Cys 



ATC CGC 
lie Arg 



GAG ACC 
Glu Thr 



GTG GGT 
Val Gly 



GCG GCG CCG CGC 
Ala Ala Pro Arg 
160 

CGG CCG CTG CAG 
Arg Pro Leu Gin 
175 

CGC GAG AAC CTG 
Arg Glu Asn Leu 
190 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



1 
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GCG CGC ATC CCT CTT AAC CCG GTA CTC CGT GAC/TAC CTG AGT TCC TTC 624 
Ala Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe 
195 200 / 205 

CCC TTC CAG ATC TGA CCGGCTG CCGCTGTGCC (/CAGCATTAA GTGGGGGCGC 67 6 
Pro Phe Gin lie * 
210 

CTTATTATTT CTTATTATTA ATTATTATTA TTTTTCTGGA ACCACGTGGG AGCCCTCCCC 73 6 

GCCTGGGTCG GAGGGAGTGG TTGTGGAGGG T(#AGATGCCT CCCACTTCTG GCTGGAGACC 796 

TCATCCCACC TCTCAGGGGT GGGGGTGCTC jtCCTCCTGGT GCTCCCTCCG GGTCCCCCCT 856 

GGTTGTAGCA GC TTGTGTC T GGGGCCAGG# CCTGAATTCC ACTCCTACCT CTCCATGTTT 916 

AC AT ATTC CC AGTATCTTTG CACAAACC^G GGGTCGGGGA GGGTCTCTGG CTTCATTTTT 976 

CTGCTGTGCA GAATATCCTA TTTTAT^fTT TTACAGCCAG TTTAGGTAAT AAACTTTATT 103 6 

ATGAAAGTTT TTTTTTAAAA GAAAA^kAAA AAAAAAAAA 1075 



(2) INFORMATION FOR SEQ jju NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 212 amino acids 

(B) TYPE: /amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TfY PE : protein 

(xi) sequence Description : seq id no:4: 



Met Val Ala Arg Asn Gin Val Ala Ala Asp Asn Ala lie Ser Pro Ala 

1 /5 10 15 

Ala Glu Pro Arg iLrq Arg Ser Glu Pro Ser Ser Ser Ser Ser Ser Ser 

20 / 25 30 

Ser Pro Ala Alar Pro Val Arg Pro Arg Pro Cys Pro Ala Val Pro Ala 

35 / 40 45 

Pro Ala Pro G/y Asp Thr His Phe Arg Thr Phe Arg Ser His Ser Asp 

50 J 55 60 

Tyr Arg Arg /lie Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 

65 / 70 75 80 

Tyr Trp Glfo Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala 

85 90 95 



Glu Pro Val Gly Thr Phe Leu Val 
100 



Arg Asp 
105 



Phe Phe Ala Leu Ser Val 
115 



Val His 
130 

Phe Asp 
145 

Arg Met 
Glu Leu 
Ala Arg 



Pro Phe 
210 



Phe Gin Ala Gly 



Lys Met 
120 

Arg Phe 
135 



Cys Leu Phe Glu Leu Leu 
150 

Leu Gly Ala Pro Leu Arjg 
165 

Cys Arg Gin Arg 
180 

lie Pro Leu Asn 
195 

Gin lie 



Ala Ala 
185 

Leu Arg 



Asp Gly 
140 

Tyr Val 
155 

Arg Val 
Val Gly 
Asp Tyr 



Gin Arg Asn Cys 
110 

Thr Ser lie Arg 
125 

Ser Arg Glu Thr 



Ala Ala Pro Arg 
160 

Arg Pro Leu Gin 
175 

Arg Glu Asn Leu 
190 

Leu Ser Ser Phe 
205 



(2) INFORMATION FOR 9EQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE -J nucleic acid 

(C) STRAj&DEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) N&ME/KEY: CDS 

(B) LOCATION: 223.. 819 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GCGATCTGTG G&TGACAGTG TCTGCGAGAG ACTTTGCCAC ACCATTCTGC CGGAATTTGG 60 

AGAAAAAGAA 0CAGCCGCTT CCAGTCCCCT CCCCCTCCGC CACCATTTCG GACACCCTGC 12 0 

ACACTCTCGT /TTTGGGGTAC CCTGTGACTT CCAGGCAGCA CGCGAGGTCC ACTGGCCCCA 180 

GCTCGGGCGi/ CCAGCTGTCT GGGACGTGTT GACTCATCTC CC ATG ACC CTG CGG 234 

Met Thr Leu Arg 
1 
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TGC 
Cys 
5 

GGG 
Gly 



GCG 
Ala 



ATG 
Met 



ACT 
Thr 



TCC 
Ser 
85 

GAT 
Asp 



AAA 
Lys 



CTG GAG CCC TCC GGG AAT GGA GCG GAC AGG ACG CGG 
Leu Glu Pro Ser Gly Asn Gly Ala Asp Arg Thr/Arg 

10 15 

ACC GCG GGG TTG CCG GAG GAA CAG TCC CCC C^AG GCG 
Thr Ala Gly Leu Pro Glu Glu Gin Ser Pro/Glu Ala 
25 30 

AAA GCC CTG CGC GAG CTC AGT CAA ACA GGA TGG TAC 
Lys Ala Leu Arg Glu Leu Ser Gin Thr /Gly Trp Tyr 
40 45 

ACT GTT AAT GAA GCC AAA GAG AAA TT^A AAA GAG GCT 
Thr Val Asn Glu Ala Lys Glu Lys l/eu Lys Glu Ala 
55 60 / 65 

TTC TTG ATT AGA GAT AGT TCG CAtf TCA GAC TAC CTA 
Phe Leu lie Arg Asp Ser Ser Hits Ser Asp Tyr Leu 
70 75 / 80 

GTT AAG ACG TCA GCT GGA CCG Act AAC CTG CGG ATT 
Val Lys Thr Ser Ala Gly Prar Thr Asn Leu Arg lie 

90 / 95 

GGG AAA TTC AGA TTG GAT TCT ATC ATA TGT GTC AAG 
Gly Lys Phe Arg Leu Asp /ser lie lie Cys Val Lys 
105 / 110 



AGC CAG TGG 
Ser Gin Trp 
20 

GCG CGT CTG 
Ala Arg Leu 
35 

TGG GGA AGT 
Trp Gly Ser 
50 

CCA GAA GGA 
Pro Glu Gly 



CTA ACT ATA 
Leu Thr lie 



GT5 



CAG TTT GAC AGT GTG 
Gin Phe Asp Ser Val V 
120 



CAT CTG ATT GAC TAC TAT 
His Leu lie Asp Tyr Tyr 
125 



TGC 
Cys 



AAG GAT AAA CGG ACA jGGC CCA GAA GCC CCA CGG AAT 



Lys Asp Lys Arg 
135 



Thr/ Gly Pro Glu Ala Pro Arg Asn 
140 145 



GAG TAC CAA 
Glu Tyr Gin 
100 

TCC AAG CTT 
Ser Lys Leu 
115 

GTC CAG ATG 
Val Gin Met 
. 130 

GGG ACT GTT 
Gly Thr Val 



282 



330 



378 



426 



474 



522 



570 



618 



666 



CCT CTG TAT ACA TCA GCA CCC 
^s Pro Leu Tyr Thr Ser Ala Pro 
155 160 

jCC ATT AAC AAA TGT ACC GGT ACG 
.a lie Asn Lys Cys Thr Gly Thr 
170 175 



CAC CTG TAC CTG ACC 
His Leu Tyr Leu Thr 
150 

CAT TTC TGT CGA CTC 
His Phe Cys Arg Lei 
165 

CTG CCT TTA CCA ACk AGA CTA AAA GAT TAC TTG GAA GAA 
Leu Pro Leu Pro Tfir Arg Leu Lys Asp Tyr Leu Glu Glu 

5 190 

CAG GTA TAAGTATT^C TCTCTCTTTT TCGTTTTTTT TTAAAAAAAA 
Gin Val 



ACT CTG CAG 
Thr Leu Gin 



ATC TGG GGA 
lie Trp Gly 
180 

TAT AAA TTC 
Tyr Lys Phe 
195 

AAAAACACAT 



714 



762 



810 



866 



GCCTCATATA GACTjATCTCC GAATGCAGCT ATGTGAAAGA GAACCCAGAG GCCCTCCTCT 92 6 

GGATAACTGC GCA&AATTCT CTCTTAAGGA CAGTTGGGCT CAGTCTAACT TAAAGGTGTG 98 6 
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AAGATGTAGC TAGGTATTTT AAAGTTCCCC TTAGGTAGTT TTAGCTGAAT GATGCTTTCT 104 6 

TTCCTATGGC TGCTCAAGAT CAAATGGCCC TTTTAAATGA #ACAAAACAA AACAAAACAA 110 6 

AAAAAAAAAA AAAAA / 1121 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino ajb ids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: proteii 

(xi) SEQUENCE DESCRIPTION/: SEQ ID NO : 6 



Met 
1 

Arg 
Ala 
Tyr 



Thr Leu Arg Cys Leu Glu Pro Ser Gly Asn Gly 
5 / 10 

Ser Gin Trp Gly Thr Ala/ Gly Leu Pro Glu Glu 
20 / 25 

Ala Arg Leu Ala Lys Mia. Leu Arg Glu Leu Ser 
35 / 40 



Ala Asp Arg Thr 
15 

Gin Ser Pro Glu 
30 

Gin Thr Gly Trp 
45 



Trp Gly Ser Met Thtf Val Asn Glu Ala Lys Glu Lys Leu Lys Glu 
50 / 55 60 



Ala 
65 

Leu 



He 
Lys 
Tyr 



Asn 
145 

Pro 



Thr 



Pro Glu Gly Thr ^he Leu He Arg Asp Ser Ser 

70 75 

Leu Thr He Se?f Val Lys Thr Ser Ala Gly Pro 
85 90 

Glu Tyr Gin £jsp Gly Lys Phe Arg Leu Asp Ser 
100 / 105 



Ser Lys Leu /Lys 
115 



Gin Phe Asp Ser Val Val His 
120 



His Ser Asp Tyr 
80 

Thr Asn Leu Arg 
95 

He lie Cys Val 
110 

Leu He Asp Tyr 
12 5 



Val Gin Meyt Cys Lys Asp Lys Arg Thr Gly Pro Glu Ala Pro Arg 
130 / 135 140 



Gly Thr \/al His Leu Tyr Leu Thr Lys Pro Leu 

150 155 

Thr Leu /Gin His Phe Cys Arg Leu Ala He Asn 
165 170 

He Tr^ Gly Leu Pro Leu Pro Thr Arg Leu Lys 
180 185 



Tyr Thr Ser Ala 
160 

Lys Cys Thr Gly 
175 

Asp Tyr Leu Glu 
190 
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Glu Tyr Lys Phe Gin Val 
195 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

' (A) NAME/KEY: CDS 

(B) LOCATION: 18.. 695 



(xi) SEQUENCE DESCRIPTION: SpQ ID NO: 7 

CGCTGGCTCC GTGCGCC ATG GTC ACC £AC AGC AAG TTT CCC GCC GCC GGG 

Met Val Thr/ His Ser Lys Phe Pro Ala Ala Gly 
1/5 10 



50 



ATG AGC CGC CCC CTG GAC ACC AGC CTG CGC CTC AAG ACC TTC AGC TCC 
Met Ser Arg Pro Leu Asp Thr/Ser Leu Arg Leu Lys Thr Phe Ser Ser 
15 / 20 25 



98 



AAA AGC GAG TAC CAG CTG 
Lys Ser Glu Tyr Gin Leu V/ 
30 



GTG AAC GCC GTG CGC AAG CTG CAG GAG 
Val Asn Ala Val Arg Lys Leu Gin Glu 
35 40 



146 



AGC GGA TTC TAC TGG AGC /GCC GTG ACC GGC GGC GAG GCG AAC CTG CTG 



Ser Gly Phe Tyr Trp 
45 



Ala Val Thr Gly Gly Glu Ala Asn Leu Leu 
50 55 



194 



CTC AGC GCC GAG CCC 
Leu Ser Ala Glu Pro 
60 



?G GGC ACC TTT CTT ATC CGC GAC AGC TCG GAC 
La Gly Thr Phe Leu lie Arg Asp Ser Ser Asp 
r 65 70 75 



242 



CAG CGC CAC TTC TTC /ACG TTG AGC GTC AAG ACC CAG TCG GGG ACC AAG 
Gin Arg His Phe PlW Thr Leu Ser Val Lys Thr Gin Ser Gly Thr Lys 

80 85 90 



290 



AAC CTA CGC ATC C; 
Asn Leu Arg lie G/ 
95 



TGT GAG GGG GGC AGC TTT TCG CTG CAG AGT GAC 
Cys Glu Gly Gly Ser Phe Ser Leu Gin Ser Asp 
100 105 



338 



CCC CGA AGC ACG CAG CCA GTT CCC CGC TTC GAC TGT GTA CTC AAG CTG 
Pro Arg Ser Thr /Gin Pro Val Pro Arg Phe Asp Cys Val Leu Lys Leu 
110 / 115 120 



386 



115- 



GTG CAC CAC TAC ATG CCG CCT CCA GGG ACC CCC /TCC TTT TCT TTG CCA 43 4 

Val His His Tyr Met Pro Pro Pro Gly Thr Prey Ser Phe Ser Leu Pro 
125 130 / 135 

CCC ACG GAA CCC TCG TCC GAA GTT CCG GAG GAG CCA CCT GCC CAG GCA 482 
Pro Thr Glu Pro Ser Ser Glu Val Pro Glu /Gin Pro Pro Ala Gin Ala 
140 145 / 150 155 

CTC CCC GGG AGT ACC CCC AAG AGA GCT TAC TAC ATC TAT TCT GGG GGC 530 
Leu Pro Gly Ser Thr Pro Lys Arg Ala Tyr Tyr lie Tyr Ser Gly Gly 

160 /L65 170 

GAG AAG ATT CCG CTG GTA CTG AGC QGN CCT CTC TCC TCC AAC GTG GCC 578 
Glu Lys lie Pro Leu Val Leu Ser Ai/g Pro Leu Ser Ser Asn Val Ala 
175 1/80 185 

ACC CTC CAG CAT CTT TGT CGG AAG Act GTC AAC GGC CAC CTG GAC TCC 62 6 

Thr Leu Gin His Leu Cys Arg Lys' Thr Val Asn Gly His Leu Asp Ser 
190 19B 200 

TAT GAG AAA GTG ACC CAG CTG OCT GGA CCC ATT CGG GAG TTC CTG GAT 674 
Tyr Glu Lys Val Thr Gin Leu /Pro Gly Pro lie Arg Glu Phe Leu Asp 
205 210/ 215 

CAG TAT GAT GCT CCA CTT TAAGGAGCAA AAGGGTCAGA GGGGGGCCTG 722 
Gin Tyr Asp Ala Pro Leu / 
220 225 / 

GGTCGGTCGG TCGCCTCTCC TfCCGAGGCAC ATGGCACAAG CACAAAAATC CAGCCCCAAC 782 

GGTCGGTAGC TCCCAGTGAG yCCAGGGGCAG ATTGGCTTCT TCCTCAGGCC CTCCACTCCC 842 

C\\ GCAGAGTAGA GCTGGCAGG^ CCTGGAATTC GTCTGAGGGG AGGGGGAGCT GCCACCTGCT 9 02 

TTCCCCCCTC CCCCAGCTGC AGCTTCTTTC AAGTGGAGCC AGCCGGCCTG GCCTGGTGGG 9 62 

AC AAT AC CTT TGACAAGCCGG ACTCTCCCCT CCCCTTCCTC CACACCCCCT CTGCTTCCCA 1022 

AGGGAGGTGG GGACACCTCC AAGTGTTGAA CTTAGAACTG CAAGGGGAAT CTTCAAACTT 1082 

TCCCGCTGGA AC TTGTTTGC GCTTTGATTT GGTTTGATCA AGAGCAGGCA CCTGGGGGAA 1142 

GGATGGAAGA GAAA^GGGTG TGTGAAGGGT TTTTATGCTG GCCAAAGAAA TAACCACTCC 12 02 

CACTGCCCAA CCTAGGTGAG GAGTGGTGGC TCCTGGCTCT GGGGAGAGTG GCAAGGGGTG 12 62 

ACCTGAAGAG AGC T AT ACTG GTGCCAGGCT CCTCTCCATG GGGCAGCTAA TGAAACCTCG 1322 

CAGATCCCTT G/CACCCCAGA ACCCTCCCCG TTGTGAAGAG GCAGTAGCAT TTAGAAGGGA 13 82 

GACAGATGAG yGCTGGTGAGC TGGCCGCCTT TTCCAACACC GAAGGGAGGC AGATCAACAG 1442 

ATGAGCCATC7 TTGGAGCCCA GGTTTCCCCT GGAGCAGATG GAGGGTTCTG CTTTGTCTCT 1502 

CCTATGTGG<E GCTAGGAGAC TCGCCTTAAA TGCCCTCTGT CCCAGGGATG GGGATTGGCA 1562 



CACAAGGAGC 
GGCCAGGGGA 
ACTGGGAAGA 
GCCCTGCACA 
CATGCCGCTC 
CTGTGTTGGG 
TGTTGGGGTG 
TTATAAAAAT 
GATGCTTGAA 
TTATACTCAG 
AAAAAAAAAA 



CAAACACAGC 
AGTGGCTGCA 
CATTGGCCAG 
GCCCTCCTTT 
ACAGGGGCCT 
GAGAAACAAG 
GCTTTTTTCT 
CCACCTCCAG 
AAACTCAACC 
AAAAGAAACA 
AAAAAAAAAA 



CAATAGGCAG 
GGGGAGAGAC 
TCCTAGTCAT 
CTCACCTGGG 
CACGGGAATG 
TTTTCTGAAG 
CTCTGTTTTG 
CCCGCCCCTC 
AAATCCCAGT 
TTTCAGTAAT 
AAAAA 



AGAGTTGAGG 
CCAGTCACTC 
CTCTCGGTCA 
GGGAGGCAGG 
CAGCAGCCi 
TCAGGTATGG 
AATAATGTTT 

tccccactca 

:aactcaga 
/ttataataaa 



GATTCAQ 
CAGGAGACTC 
GT^GGTCCGA 
.GGTGATGGA 
GCAATTACCT 
GGCTGGGTGG 
ACAATTTGCC 
GGCCTTCGAG 
CTTTGCACAT 
AGAGCACTAT 



GGTGGCTACA 
CTGAGTTAAC 
GAGCTTCCAG 
GAAGCCTTCC 
GGAACTGGTC 
GGCAGCTGTG 
TCAATCACTT 
GCTGTCTGAA 
ATATTTATAT 
TTTTTAATGA 



(2) INFORMATION FOR SEQ ID N0: 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2^5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Val Thr His Ser ij/s Phe Pro Ala Ala Gly Met 
1 5 j 10 

Asp Thr Ser Leu Arg/Leu Lys Thr Phe Ser Ser Lys 
20 / 25 

Leu Val Val Asn Aya Val Arg Lys Leu Gin Glu Ser 
35 / 40 

Ser Ala Val Thr Gly Gly Glu Ala Asn Leu Leu Leu 
50 / 55 60 

Ala Gly Thr Phef Leu lie Arg Asp Ser Ser Asp Gin 
65 / 70 75 

Thr Leu Ser V$l1 Lys Thr Gin Ser Gly Thr Lys Asn 

85 90 

Cys Glu Gly jfely Ser Phe Ser Leu Gin Ser Asp Pro 
.00 105 



Ser Arg Pro Leu 
15 

Ser Glu Tyr Gin 
30 

Gly Phe Tyr Trp 
45 

Ser Ala Glu Pro 



Arg His Phe Phe 
80 

Leu Arg lie Gin 
95 

Arg Ser Thr Gin 
110 
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Pro Val Pro 
115 

Pro Pro Pro 
130 

Ser Glu Val 
145 

Pro Lys Arg 
Val Leu Ser 



Cys Arg Lys 
195 

Gin Leu Pro 
210 

Leu 
225 



Arg Phe 
Gly Thr 
Pro Glu 



Ala Tyr 
165 

Arg Pro 
180 



Asp Cys Val Leu Lys Leu 
120 

Phe Ser Leu Pro 
Pro Ala 



Pro Ser 
135 



Val His 
125 

Pro Thr 
140 



Gin Pro 
150 



Gin Ala 
155 



Tyr lie Tyr Ser Gly Gly Glj/ Lys 

170 / 

Leu Ser Ser Asn Val Ala /Thr Leu 



Ser Asn 
185 



His Leu 
200 



Thr Val Asn Gly 

Gly Pro lie Arg Glu Phe 
215 



Asp S< 
Leiy Asp 



Tyr Glu 
205 

Gin Tyr 
220 



His /Tyr Met 

Lu Pro Ser 

Gly Ser Thr 
160 

lie Pro Leu 

175 

Gin His Leu 
190 

Lys Val Thr 
Asp Ala Pro 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 bafee pairs , 

(B) TYPE: nucleic >acid 
(D) TOPOLOGY: line 



(ii) 


MOLECULE TYPE: protein 








(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 9: 






CTCCGGCTGG 


CCCCTTCTGT 


AGGATGGTAG 


CACACAACCA 


GGTGGCAGCC 


GACAATGCAG 


60 


TCTCCACAGC 


AGCAGAGCCC 


CGACGGCGGC 


CAGAACCTTC 


CTCCTCTTCC 


TCCTCCTCGC 


120 


CCGCGGCCCC 


CGCGCGCCCG 


CGGCCCGTGCC 


CCGCGGTCCC 


GGCCCCGGCC 


CCCGGCGACA 


180 


CGCACTTCCG 


CACATTCCGT 


TCG&CACGCCG 


ATTACCGGCG 


CATCACGCGC 


GCCAGCGCGC 


240 


TCCTGGACGC 


CTGCGGATTC 


TACTGGGGGC 


CCCTGAGCGT 


GCACGGGGCG 


CACGAGCGGC 


300 


TGCGCGCCGA 


GCCCGTGGGC 


AlCCTTCCTGG 


TGCGCGACAG 


CCGCCAGCGG 


AACTGCTTTT 


360 


TCGCCCTTAG 


CGTGAAGATG 


GCCTCGGGAC 


CCACGAGCAT 


CCGCGTGCAC 


TTTCAGGCCG 


420 


GCCGCTTTCA 


CCTGGATGGCi 


(AGCCGCGAGA 


GCTTCGACTG 


CCTCTTCGAG 


CTGCTGGAGC 


480 


ACTACGTGGC 


GGCGCCGCGGf 


CGCATGCTGG 


GGGCCCCGCT 


GCGCCAGCGC 


CGCGTGCGGC 


540 



CGCTGCAGGA 
GCATCCCCCT 
CGGCAGCGCC 
GCCTGGAACC 
GCGAGGCGCC 
GGGGTCCTCC 
GTATCTGGAG 
TATCTTTGCA 
AATCCTATTT 
TTTTTTAAAA 



GCTGTGCCGC 
CAACCCCGTC 
CGCCGTGCAC 
ATGTGGGTAC 
TCCCGCCCTC 
CCCTCCTGGT 
CCAGGACCTG 
CAAACCAGGG 
TATATTTTTT 
AAAA 



CAGCGCATCG 
CTCCGCGACT 
GCAGCATTAA 
CCTCCCCGGC 
GGCTGGAGAC 
GCTCCCTCTG 
AACTCGCACC 
GTTGGGGGAG 
AAAGTCAGTT 
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TGGCCACCGT 
ACCTGAGCTC 
CTGGGATGCC 
CTGGGTTGGA 
GAGGCCGCAG 
GGTCCCCCTG 
TCCTACCTCT, 
GGTCTCTC 
TAGGTAATAA 



GGGCCGCGAG 
CTTCCCCT'] 
GTGTTATTTT 
GGGAGCGGAT 
ACGCCTTCTC 
'TGTTGTAG 
'TCATGTTTAC 
TTTATTTTTC 
ACTTTATTAT 



lCCTGGCTC 600 

CAGATTTGAC 660 

GTTATTACTT 720 

GGGTGTAGGG 780 

ACCTCTTGAG 840 

CAGCTTAACT 900 

ATATACCCAG 960 

TGCTGTGC AG 102 0 

GAAAGTTTTT 1080 
1094 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino aqids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ing] 
Q l (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: proteil 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Val Ala His Asn Glry Val Ala Ala Asp Asn Ala Val Ser Thr Ala 

1 5 / 10 15 

Ala Glu Pro Arg Arg Arfg Pro Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 / 25 30 

Pro Ala Ala Pro Ala Arg Pro Arg Pro Cys Pro Ala Val Pro Ala Pro 
35 / 40 45 

Ala Pro Gly Asp Thr/ His Phe Arg Thr Phe Arg Ser His Ala Asp Tyr 
50 / 55 60 

Arg Arg lie Thr Arfg Ala Ser Ala Leu Leu Asp Ala Cys Gly Phe Tyr 
65 / 70 75 80 



Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala Glu 
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Pro Val Gly Thr Phe Leu Val Arg Asp Ser Arg 
100 105 

Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro 

115 120 

His Phe Gin Ala Gly Arg Phe His Leu Asp GIa 
130 135 



Gin A/g Asn Cys 
110 

Ser lie Arg 
125 

Ser Arg Glu Ser 
140 



Asp Cys Leu Phe Glu Leu Leu Glu His Tyr/val Ala Ala Pro Arg 
145 150 / 155 



Met Leu Gly Ala Pro Leu Arg Gin Arg Arg Val 

165 /L70 

Leu Cys Arg Gin Arg lie Val Ala Thx Val Gly 
180 18T5 

Arg lie Pro Leu Asn Pro Val Leu yArg Asp Tyr 
195 20 

Phe Gin lie 
210 

(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 807 Base pairs 

(B) TYPE: nucleic/ acid 
C\{ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Arg Pro Leu Gin 
175 

Arg Glu Asn Leu 
190 

Leu Ser Ser Phe 
205 



Phe 

Val 

Phe 

Arg 
160 

Glu 
Ala 
Pro 



GGAAACCGAG 


GCGGGGAGAC 


CAGGfAGGCCT 


TGGCCTCAGA 


GCTTCAGAGT 


CGCGTGGCAG 


60 


CAAACAGAGA 


AACCTGTAGA 


GGGCAGTGTG 


CGTCACTTAG 


CTCAGGGAAG 


CTGCACGCGA 


120 


AACTCACCCG 


CCTTCATTCA 


TMACATCGT 


CAGCTAGGCA 


CCTACTCCTG 


GGCTTTCAGG 


180 


ACAAACTGAA 


TCACGAAACC 


AjCAGTGTCCT 


TAAAATAGGT 


CTGACCGCCT 


GAATCCCTGG 


240 


CCAAGGTGTG 


TACGGGGCAT 


©GGAGCCCTT 


GTGCAGAGAT 


GCTTGCAGGA 


GCCTTGAGGG 


300 


GCTCTGTAAG 


ACAGAGGCTA 


/GGAAGACAAA 


GTTGGGGGCT 


ACAGCTTCTT 


GTCCTGCCCG 


360 


GGGCCTCAGT 


TTCTTCGGTTf 


GCCCACGTAG 


GAGTGCAGAG 


AGTCCAGCCC 


CTGGGGACCC 


420 


AACCCAACCC 


CGCCCAGTTT 


CCGAGGAACT 


CGTCCGGGAG 


CGGGGGCGCC 


CCTCCCGCAC 


480 


CGCCTTAGGC 


TTCCTTTGAfk 


GCCTCTGCGG 


TCAGGCCACC 


GCTTCCTGGG 


AAGCCCAAGC 


540 


CAAGGCCAGG 


CCGAGTGGCEC 


AACGGGAGGG 


GCCCGCGCGC 


GATTCTGGAG 


GAGGGCGGCG 


600 
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GCCCCACAGG TCTCCAGGGC TGGCTAGCCG GGCTCCTAGA GCGGAGAQTG CCAAGGCCTT 660 

CGGGTCCTGG GCAGGAAGGA TCCTGGCAGG GAGGAGTTGC TTGGG^GGTG GGGGGGAAAG 720 

GCTCCAGGCG CGGTGGAGCT CTGACCAGGA GAATGCACAC ACT^GGAGGG GAGGAGGCGT 7 80 

GTCAGCCCCA AGCTAGCATC CCACCCGGGG AGCAGCGATG T^GGGCGAAG GTAGCCAGAG 840 

CAAAAGAGCA GGCACCAGGT GACACGAAAC AGAAGATTCC/GGGTAGAGCC AGAACCCCAG 9 00 

AAGTCCCATT CAGGGAAGGT GCGAGGCGAG AACGAGTT^G GTGGACCCTC TCCAGGGGCA 960 

GCCAAAGAAA TCTAAAGAGA ACCCGAAGGA CTTGCCGGAA AGAGAAACCG AAAGCGGCGG 102 0 

TGGGCGGGAT CGGTGGGCGG GGCCTCCCTG GTTTiyCGAGC TTGATGCAGG GGCGGGCAGC 1080 

AGCAGAGAGA ACTGCGGCCG TGGCAGCGGC ACGGCTCCCG GCCCCGGAGC ATGCGCGACA 1140 

GCAGCCCCGG AACCCCCAGC CGCGGCGCCC CGfCGTCCCGC CGCCAGGTGA GCCGAGGCAG 12 00 

CTGCGAAGGA GCAGGCGGGA GGGGATGGGA GGAAGGGGAG CAGAGCCTGG CAGGACTATC 12 60 

CTCGCAGACT GCATGGCGGG GTCGTGGATg/ctATGCCTCT GGCGCCCGCC CCACCGGCTG 132 0 

GCCCAGGCGG CCCCTCGCGC GCGCGGGGQG CCGTCAGCCC CTCCTCTCCG GCCCTGAGCC 13 80 

CGGATCGTCC GCCCGGGTTC CAGTTCCGGG CGTGGCCAGT AGGCGGCAAC CGCGAGGCGG 1440 

CAAGCCACCC AGCGGGGACG GCCTGGAGTC GGGCCCCTCT CCACGCCCCC TTCTCCACGC 1500 

GCGCGGGGAG GCAGGGCTCC ACCGCdAGTC TGGAAGGGTT CCACATACAG GAACGGCCTA 1560 

CTTCGCAGAT GAGCCCACCG AGGCyCAGGC TCCGGGCGGA TTCTGCGTGT CACCCTCGCT 162 0 

CCTTGGGGTC CGCTGGCCGG CCTQTGCCAC CCGGACGCCC GGTTCACTGC CTCTGTCTCC 1680 

CCCATCAGCG CAGCCCCGGA CGGTATGGCC CACCCCTCCA GCTGGCCCCT CGAGTAGGAT 1740 

GGTAGCACGT AACCAGGTGG AAjGCCGACAA TGCGATCTCC CCGGCATCAG AGCCCCGACG 18 00 

GCGGCCAGAG CCATCCTCGT CCTCGTCTTC GTCCTCGCCG GCGGCCCCGG CGCGTCCCCG 1860 

GCCCTGCCCG GTGGTCCCGG GCCCGGCTCC GGGCGACACT CACTTCCGCA CCTTCCGCTC 192 0 

CCACTCTGAT TACCGGCGCA /tCACGCGGAC CAGCGCTCTC CTGGACGCCT GCGGCTTCTA 19 80 

CTGGGGACCC CTGAGCGTGC/ ATGGGGCGCA CGAACGGCTG CGTTCCGAAC CCGTGGGCAC 2 040 

CTTCTTGGTG CGCGACAGTC GCCAGCGGAA CTGCTTCTTC GCGCTCAGCG TGAAGATGGC 2100 

TTCGGGCCCC ACGAGCATT7C GTGTGCACTT CCAGGCCGGC CGCTTCCACC TGGACGGCAA 2160 

CCGCGAGACC TTCGACTGCC TCTTCGAGCT GCTGGAGCAC TACGTGGCGG CGCCGCGCCG 2 22 0 

CATGTTGGGG GCCCCACTGC GCCAGCGCCG CGTGCGGCCG CTGCAGGAGC TGTGTCGCCA 2 2 80 



GCGCATCGTG 
CCGTGACTAC 
ATTAAGTGGG 
GTGGGAGCCC 
TTCTGGCTGG 
GTCCCCCTGG 
CCATGTTTAC 
TCATTTTTCT 
ACTTTATTAT 



GCCGCCGTGG 
CTGAGTTCCT 
AGCGCCTTAT 
TCCCCGCCTA 
AGACCTTATC 
TTGTAGCAGC 
ATGTTCCCAG 
GCTGTGCAGA 
GAAAGTTTTT 



GTCGCGAGAA 
TCCCCTTCCA 
TATTTCTTAT 
GGTCGGAGGG 
CCGCCTCTCG 
TTGTGTCTGG 
TATCTTTGCA 
ATATTCTATT 
TTTTTTAAAG 
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CCTGGCACGC 
GATCTGACCG 
TATTAATTAT 

agtgggtgt^ 
gggggcqtcc 
ggccaggacc 
caaaccaggg 
t/atattttt 

^CAAAGAT 



ATCCOTCTTA 
GC^GCCGCCG 
'ATTATTTTT 
GAGGGTGAGA 
CCTCCTGGTG 
TGAACTCCAC 
GTGGGGGAGG 
ACATCCAGTT 
TTCTAGA 



ACCCGGTACT 2340 

TGCCCGCAGA 2400 

CTGGAACCAC 2460 

TCCCTCCCAC 252 0 

CTCCCTCCCG 2 580 

GCCTACCTCT 2 640 

GTCTCTGGCT 27 00 

T AGATAATAA 27 60 
2807 



(2) INFORMATION FOR SEQ ID NO:/L2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 212 am£no acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS/ single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 



Met Val Ala Arg ^fen Gin Val Glu Ala 
1 



Ser Glu Pro Arg/Arg 
20 



Arg Pro Glu Pro 
25 



Ser Pro Ala Ala Pro Ala Arg Pro Arg 
35 / 40 

Pro Ala Pro GT_y Asp Thr His Phe Arg 
50/ 55 

Tyr Arg Arg /lie Thr Arg Thr Ser Ala 
65 / 70 

Tyr Trp Gly Pro Leu Ser Val His Gly 

85 



Asp Asn 
10 

Ser Ser 



Pro Cys 
Thr Phe 



Leu Leu 
75 

Ala His 
90 



Glu Pro Vf 1 Gly Thr Phe Leu Val Arg Asp Ser 
100 105 



Ala lie Ser 

Ser Ser Ser 
30 

Pro Val Val 
45 

Arg Ser His 
60 

Asp Ala Cys 
Glu Arg Leu 



Arg Gin Arg 
110 



Pro Ala 
15 

Ser Ser 



Pro Ala 



Ser Asp 



Gly Phe 
80 

Arg Ser 
95 

Asn Cys 
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Phe Phe Ala Leu Ser Val Lys Met Ala Ser Gly p/o Thr Ser lie Arg 
115 120 / 125 

Val His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Asn Arg Glu Thr 
130 135 / 140 

Phe Asp Cys Leu Phe Glu Leu Leu Glu His' Tyr Val Ala Ala Pro Arg 
145 150 / 155 160 

Arg Met Leu Gly Ala Pro Leu Arg Gln/Arg Arg Val Arg Pro Leu Gin 

165 / 170 175 

Glu Leu Cys Arg Gin Arg lie Val Ala Ala Val Gly Arg Glu Asn Leu 
180 /85 190 

Ala Arg lie Pro Leu Asn Pro Val/ Leu Arg Asp Tyr Leu Ser Ser Phe 
195 20y0 205 

Pro Phe Gin lie 
210 



(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 1611 bafee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : /single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DN^ 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: /263..1529 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGAATTCCGG GCGGGCTGTG fl?GAGTCTGTG AGTGGAAGGC GCGCCGGCTC TTTTGTCTGA 6 0 

GTGTGACCCG GTGGCTTTGT / TCCAGGC ATT CCGGTGATTT CCTCCGGGCA GTCCGCAGAA 12 0 

GCCGCAGCGG CCGCCCGCGqf TCTCTCTGCA GTCTCCACAC CCGGGAGAGC CTGAGCCCGC 180 

GTCACGCCCC TCAGCCCCCfe CTGAGTCCCT TCTCTGTTGT CGCGTCCGAA TCGAGTTCCC 240 

GGAATCAGAC GGTGCCCC^T AG ATG GCC AGC TTT CCC CCG AGG GTT AAC GAG 2 92 

Met Ala Ser Phe Pro Pro Arg Val Asn Glu 
15 10 

AAA GAG ATC GTG AG^L TCA CGT ACT ATA GGG GAA CTC TTG GCT CCA GCA 3 40 

Lys Glu lie Val Art Ser Arg Thr lie Gly Glu Leu Leu Ala Pro Ala 

IB 20 25 
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GCT CCT TTT GAC AAG AAA TGT 
Ala Pro Phe Asp Lys Lys Cys 
30 



GGT GGT GAG AAC TGG 
Gly Gly Glu Asn T] 
35 



\CG GTT GCT TTT 
Thr Val Ala Phe 
40 



388 



GCT 
Ala 



CCT GAT 
Pro Asp 
45 



GGT TCC . TAC TTT 
Gly Ser Tyr Phe 



GCG TGG TCA CAA/GGA 
Ala Trp Ser Glyf Gly 
50 



TAT CGC ATA GTG 
Tyr Arg lie Val 
55 



436 



AAG CTT GTC CCG TGG TCC CAG 
Lys Leu Val Pro Trp Ser Gin 
60 65 



TGC CGT AAG 
Cys Arg Lys 



Asn 



TTT 
Phe 
70 



CTT TTG CAT GGT 
Leu Leu His Gly 



484 



TCC AAA AAT GTT ACC AAT TCA 
Ser Lys Asn Val Thr Asn Ser 
75 80 



AGC TGT OTA AAA TTG GCA AGA CAA AAC 
Ser Cys /Leu Lys Leu Ala Arg Gin Asn 
85 90 



532 



AGT AAT GGT GGT CAG AAA AAC AAG COT CCT GAG CAC GTT ATA GAC TGT 
Ser Asn Gly Gly Gin Lys Asn Lys Bro Pro Glu His Val lie Asp Cys 

95 / 100 105 



580 



GGA GAC ATA GTC TGG AGT CTT 
Gly Asp lie Val Trp Ser Leu 
110 



TTT GGG TCT TCA GTT CCA GAA AAA 
Phe Gly Ser Ser Val Pro Glu Lys 
115 120 



628 



CAG 
Gin 



AGT CGT 
Ser Arg 
125 



TGC GTT AAT ATA 
Cys Val Asn ll< 



&AA TGG CAT CGG TTC CGA TTT GGA CAG 
fGlu Trp His Arg Phe Arg Phe Gly Gin 
130 135 



676 



GAT 
Asp 



CAG CTA 
Gin Leu 
140 



CTC 
Leu 



CTT GCC AC 
Leu Ala 



/45 



GGA TTA AAC AAT 
Gly Leu Asn Asn 



GGT 
Gly 
150 



CGC ATC 
Arg lie 



AAA ATC 
Lys lie 



724 



TGG GAT GTA TAT ACA GGA 
Trp Asp Val Tyr Thr Gly/ Lys 
155 16 



CTC CTC CTT AAT TTG GTA GAC CAC ATT 
Leu Leu Leu Asn Leu Val Asp His lie 
165 170 



772 



GAA ATG GTT AGA GAT TTA ACT 
Glu Met Val Arg Asp Lfu Thr 

175 



TTT GCT CCA GAT GGG AGC TTA CTC CTT 
Phe Ala Pro Asp Gly Ser Leu Leu Leu 
180 185 



820 



GTA 
Val 



TCA GCT 
Ser Ala 



TCA 
Ser 
190 



AGA ©AC AAA 
Arg /Asp Lys 



ACT CTA AGA GTG TGG GAC CTG AAA GAT 
Thr Leu Arg Val Trp Asp Leu Lys Asp 
195 200 



868 



GAT 
Asp 



GGA AAC 
Gly Asn 
205 



ATG GTG AAA GTA 
Met Vafl. Lys Val 



TTG CGG GCA CAT CAG AAT TGG GTG TAC 
Leu Arg Ala His Gin Asn Trp Val Tyr 
210 215 



916 



AGT TGT GCA TTC TCT CCC GAC TGT TCT ATG CTG TGT 
Ser Cys Ala Phe aer Pro Asp Cys Ser Met Leu Cys 
220 / 225 230 



TCA GTG GGC GCC 
Ser Val Gly Ala 



964 



AGT AAA GCA GTT (TTC CTT TGG 
Ser Lys Ala Val /Phe Leu Trp 
235 / 240 



AAT ATG GAT AAA TAC 
Asn Met Asp Lys Tyr 
245 



ACC ATG ATT AGG 
Thr Met lie Arg 
250 



1012 
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AAG CTG GAA GGT CAT CAC CAT GAT GTT GTA GCT TGT 
Lys Leu Glu Gly His His His Asp Val Val Ala Cys 

255 260 ' 



GAC JTTT TCT CCT 
Asr/ Phe Ser Pro 
265 



1060 



GAT GGA GCA TTG CTA GCT ACT GCA TCC TAT GAC ACT 
Asp Gly Ala Leu Leu Ala Thr Ala Ser Tyr Asp Thr 
270 275 



GTG TAT GTC 
^rg Val Tyr Val 
280 



1108 



TGG GAT CCA CAC AAT GGA GAC CTT CTG ATG GAG 
Trp Asp Pro His Asn Gly Asp Leu Leu Met Glu 
285 290 



GGG CAC CTG TTT 
Gly His Leu Phe 
295 



1156 



CCC TCG CCC ACT CCA ATA TTT GCT GGA GGA GCA AAT 
Pro Ser Pro Thr Pro lie Phe Ala Gly Gly A/a Asn 
300 305 / 310 



GAC CGA TGG GTG 
Asp Arg Trp Val 



1204 



AGA GCT GTG TCT TTC AGT CAT GAT GGA CTG /CAT GTT GCC AGC CTT GCT 
Arg Ala Val Ser Phe Ser His Asp Gly Leur His Val Ala Ser Leu Ala 
315 320 / 325 330 



1252 



GAT GAT AAA ATG GTG AGG TTC TGG AGA ATC GAT GAG GAT TGT CCG GTA 
Asp Asp Lys Met Val Arg Phe Trp Arg lie Asp Glu Asp Cys Pro Val 

335 /340 345 



1300 



CAA GTT GCA CCT TTG AGC AAT GGT CTy TGC TGT GCC TTT TCT ACT GAT 
Gin Val Ala Pro Leu Ser Asn Gly Leu Cys Cys Ala Phe Ser Thr Asp 
350 3Z5 360 



1348 



GGC AGT GTT TTA GCT GCT GGG ACA OAT GAT GGA AGT GTG TAT TTT TGG 
Gly Ser Val Leu Ala Ala Gly Thr His Asp Gly Ser Val Tyr Phe Trp 
365 370/ 375 



1396 



GCC ACT CCA AGG CAA GTC CCT AGO CTT CAA CAT ATA TGT CGC ATG TCA 
Ala Thr Pro Arg Gin Val Pro Ser Leu Gin His lie Cys Arg Met Ser 
380 385 / 390 



1444 



ATC CGA AGA GTG ATG TCC ACC CAA GAA GTC CAA AAA 
lie Arg Arg Val Met Ser Thr <fln Glu Val Gin Lys 
395 400 / 405 



CTG CCT GTT CCT 
Leu Pro Val Pro 
410 



1492 



TCC AAA ATA TTG GCG TTT CTC /TCC TAC CGC GGT TAG A CTGAAGACTG 
Ser Lys lie Leu Ala Phe Leuf Ser Tyr Arg Gly * 

415 / 420 



1539 



CCTTTCCTGG TAGGCCTGCC AGAC/AGAGCG CCCTTTACAA GACACACCTC AAGCTTTACC 1599 
TCGTGCCGAA TT / 1611 



(2) INFORMATION FOR SEQr ID NO: 14 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 421 amino acids 

(B) TYPEA amino acid 
(D) TOPOLOGY: linear 



Met 
1 



(ii) MOLECULE 

(xi) SEQUENCE 

Ala Ser Phe Pro 
5 
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TYPE : protein 

DESCRIPTION: SEQ ID 

Pro Arg Val Asn Glu 

10 



NO: 14 

Lys C?lu lie 



Val Arg Ser 
15 



Arg 
Cys 
Phe 



Gin 
65 



Thr lie Gly Glu 
20 

Gly Gly Glu Asn 
35 

Ala Trp Ser Gin 
50 

Cys Arg Lys Asn 



Leu Leu Ala Pro Ala 
25 

Trp Thr Val Ala Ph€ 
40 

Gly Tyr Arg lie ^al 

55 

Phe Leu Leu Hi/s Gly 
70 



AYa. Pro Phe 



Ala Pro Asp 
45 

Lys Leu Val 
60 

Ser Lys Asn 
75 



Asp Lys Lys 
30 

Gly Ser Tyr 



Pro Trp Ser 



Val Thr Asn 
80 



Ser Ser Cys Leu Lys 

85 

Asn Lys Pro Pro Glu 
100 



Leu Ala 



/Gin Asn 
90 



Ser Asn Gly 



His Val I/e Asp Cys 
105 



Gly Asp lie 



Gly Gin Lys 
95 

Val Trp Ser 
110 



Leu Ala Phe Gly Ser 
115 



Ser Val/Pro Glu Lys 
120 



Gin Ser Arg 
125 



Cys Val Asn 



lie Glu Trp His Arg 
130 



Phe Atfg Phe Gly Gin 
IB 5 



Asp Gin Leu 
140 



Leu Leu Ala 



Thr 
145 



Gly Leu Asn Asn 



Gly £rg lie Lys lie 
150 



Trp Asp Val 
155 



Tyr Thr Gly 
160 



Lys Leu Leu Leu Asn 

165 

Thr Phe Ala Pro Asp 
180 

Lys Thr Leu Arg Val 
195 

Val Leu Arg Ala His 
210 

Asp Cys Ser Met Leu 
225 

Trp Asn Met Asp Lys 

245 

His Asp Val Val Ala 
260 



Leu/ Val Asp His lie 

170 

Gljf Ser Leu Leu Leu 
185 

Ttfp Asp Leu Lys Asp 
200 

£Ln Asn Trp Val Tyr 

215 

ys Ser Val Gly Ala 
230 

Tyr Thr Met lie Arg 

250 

Cys Asp Phe Ser Pro 
265 



Glu Met Val 



Val Ser Ala 



Asp Gly Asn 
205 

Ser Cys Ala 
220 

Ser Lys Ala 
235 

Lys Leu Glu 



Asp Gly Ala 



Arg Asp Leu 
175 

Ser Arg Asp 
190 

Met Val Lys 



Phe Ser Pro 



Val Phe Leu 
240 

Gly His His 
255 

Leu Leu Ala 
270 
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Thr Ala Ser Tyr Asp Thr Arg Val Tyr Val 
275 280 

Asp Leu Leu Met Glu Phe Gly His Leu Phe 
290 295 

Phe Ala Gly Gly Ala Asn Asp Arg Trp Val 
305 310 

His Asp Gly Leu His Val Ala Ser Leu Ala 

325 330/ 

Phe Trp Arg lie Asp Glu Asp Cys Pro Vc 
340 345 

Asn Gly Leu Cys Cys Ala Phe Ser Thr /Asp 
355 360 

Gly Thr His Asp Gly Ser Val Tyr P^fe Trp 
370 375 

Pro Ser Leu Gin His lie Cys Arg/Met Ser 
385 390 

Thr Gin Glu Val Gin Lys Leu P^o Val Pro 

405 / 410 

Leu Ser Tyr Arg Gly * 
420 

(2) INFORMATION FOR SEQ H^NOilS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY:/ linear 

(ii) MOLECULE TYPE/: DNA 



Trp Asp /Pro His Asn Gly 
85 



Pro 



sr Pro Thr Pro lie 



roo 



Ala Val 



.sp Asp Lys 
Gin Val Ala 



Gly Ser Val 
365 

Ala Thr Pro 
380 

lie Arg Arg 
395 

Ser Lys lie 



Ser Phe Ser 
320 

Met Val Arg 
335 

Pro Leu Ser 
350 

Leu Ala Ala 



Arg Gin Val 



Val Met Ser 
400 

Leu Ala Phe 
415 



(xi) SEQUENCE DE 
CTGTCTTCCT CCGCAGCGO 
CTTTGGTCTG AGGCCTTCGfG 
CCCCGCCCGT CTCCTCTGirC 
GTCGCCACTC TCTTCTCTGT 
CATAGATGGC CAGCTTTCCC 
TAGGTGAACT TTTAGCT/CCT 



:RIPTION: SEQ ID NO: 15 
AGGCTGGGTA CAGGGTCTAT 
GAGCTTTCCC GAGGCAGTTA 
CCTGGGCCCG GGAGACAAAC 
TGTTGGGTCC GCATCGTATT 
CCGAGGGTCA AC G AG AAAG A 
GCAGCTCCTT TTGACAAGAA 



TGTCTGTGGT 
GCAGAAGCCG 
TTGGCGTCAC 
CCCGGAATCA 
GATCGTGAGA 
ATGTGGTCGT 



TGACTCCGTA 60 

CAGCGACCGC 12 0 

GCCCTCAGCG 180 

GACGGTGCCC 240 

TCACGTACTA 3 00 

GAAAATTGGA 3 60 
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CTGTTGCTTT TGCTCCAGAT GGTTCATACT TTGCTTGGTC 
AGCTTGTTCC GTGGTCCCAG TGCCTTCAGA ACTTTCTCTT 
CCAATTCAAG CAGTTTAAGA TTGCCAAGAC AAAATAGTGA 
CTCGTGACAT ATTATAGACT GTGGAGATAT AGTCTGGAGT A 
TCCAGAAAAA CAGAGTCGCT GTGTAAATAT AGAATGGCi 
TCAGCTACTT CTTGCTACAG GGTTGAACAA TGGGCGTATC 
GGAAACTCCT CCTTAACTTG GTAGATCATA CTGAAQTGGT 
CAG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1122 base peyirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sing] 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



ACAAGGACAT 
GCATCGCACC 
TC^GTGGTCAG 
OTTGCTTTTG 
CGCTTCAGAT 
AAAATATGGG 
CAGAGATTTA 



CGCACAGTAA 42 0 

AAGAATGTTA 480 

AAAAATAAGC 540 

GGTCATCAGT 600 

TTGGACAAGA 6 60 

ATGTATATCA 72 0 

ACTTTTGCTC 780 

783 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 








CTCTGTATGT 


CTGAATGAAG 


CTATAACATT 


tgccttttta 


TTGCAGGTTT 


TCCTTTGGAA 


60 


TATGGATAAA 


TACACCATGA 


TACGGAAACT 


AGAAGGACAT 


CACCATGATG 


TGGTAGCTTG 


120 


TGACTTTTCT 


C C TGATGG AG 


CATTACTGGC 


TACTGCATCT 


TATGATACTC 


GAGTATATAT 


180 


CTGGGATCCA 


CATAATGGAG 


ACATTCTGAT 


GGAATTTGGG 


CACCTGTTTC 


CCCCACCTAC 


240 


TCCAATATTT 


GCTGGAGGAG 


CAipVTGACCG 


GTGGGTACGA 


TCTGTATCTT 


TTAGCCATGA 


300 


TGGACTGCAT 


GTTGCAAGCC 


TTGCTGATGA 


TAAAATGGTG 


AGGTTCTGGA 


GAATTGATGA 


360 


GGATTATCCA 


GTGCAAGTTG 


CACCTTTGAG 


CAATGGTCTT 


TGCTGTGCCT 


TCTCTACTGA 


420 


TGGCAGTGTT 


TTAGCTGCTG 


GGACACATGA 


CGGAAGTGTG 


TATTTTTGGG 


CCACTCCACG 


480 


GCAGGTCCCT 


AGCCTGCAAC 


ATTTATGTCG 


CATGTCAATC 


CGAAGAGTGA 


TGCCCACCCA 


540 


AGAAGTTCAG 


GAGCTGCCGA 


/ttccttccaa 


GCTTTTGGAG 


TTTCTCTCGT 


ATCGTATTTA 


600 


GAAGATTCTG 


CCTTCCCTAG i 


'tagtagggac 


TGACAGAATA 


CACTTAACAC 


AAACCTCAAG 


660 


CTTTACTGAC 


TTCAATTATC/ 


tgtttttaaa 


GACGTAGAAG 


ATTTATTTAA 


TTTGATATGT 


720 


TCTTGTACTG 


CATTTTGATG 


agttgagctt 


TTAAAATATT 


ATTTATAGAC 


AATAGAAGTA 


780 
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TTTCTGAACA 


TATCAAATAT 


AAATTTTTTT 


AAAGATCTAA 


CTGTGAAAAC 


ATACATACCT 


840 


GTACATATTT 


AGATATAAGC 


TGCTATATGT 


TGAATGGACCy 


/ 2TTTTGCTTT 


TCTGATTTTT 


900 


AGTTCTGACA 


TGTATATATT 


GCTTCAGTAG 


agccacaa/a 


TGTATCTTTG 


CTGTAAAGTG 


960 


CAAGGAAATT 


TTAAATTCTG 


GGACACTGAG 


TTAGATJSGTA 


AATACTGACT 


TACGAAAGTT 


1020 


GAATTGGGTG 


AGGCGGGCAA 


ATCACCTGAG 


GTCAGTCAGTT 


TGAGACTAGC 


CTGGCAAACA 


1080 


TGATGAAACC 


CTGTCTCTAC 


TAAAAATACA 




AA 




1122 



(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 2 544 base/ pairs 

(B) TYPE: nucleic ac/d 

(C) STRANDEDNESS: s^Agle 

(D) TOPOLOGY: linec 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME / KEY : 

(B) LOCATION: 



2030 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGGCACGAGC CGGGCTCCGT 0CGGAGGAAG CGAGGCTGCG CCGCCGGCCC GGCAGGAGCG 
GAGGACGGGA MGCGCGGGCG GTCGCGCTCG CCCTGTCGCT GACTGCGCTG CCCCGGCCCA 
TCCTTGCCTG GCCGCAGGTG / CCCTGGATGA GGCCGCCGCG CGTGTCCCGG CCGCTGAGTG 
TCCCCCGCGG TCGCCCGGCGj/ CCTGCCCTCA AGCGGCCGCC TCTCCTTGCC CGGGTCCCCG 
TTTTCCCCCG GCGCAGTCCT CCTCCGGTGG GCGCCTCCGC ACCTCGGCGC AGGCGGCACG 
GCCCTCGGGC CGGGATGGAET CCGCCGGGAA GAGGAAGACA AGCCGGGGCG TTGAGCCCCT 



GCGCACGGTG CCGCCGCG 

CA ATG GAT AAA GTG 
Met Asp Lys Val 
1 



G TAGTGGGAGC TTACTCGCAG TAGGCTCTCG CTCTTCTAAT 

CtGG AAA ATG TGG AAC AAC TTA AAA TAC AGA TGC 
31y Lys Met Trp Asn Asn Leu Lys Tyr Arg Cys 
5 10 15 



CAG AAT CTC TTC AGC 
Gin Asn Leu Phe Ser 

2C 



CAC GAG GGA GGA AGC CGT AAT GAG AAC GTG GAG 
His Glu Gly Gly Ser Arg Asn Glu Asn Val Glu 

25 30 



60 
120 
180 
240 
300 
360 
420 
467 

515 
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ATG AAC CCC AAC AGA TGT CCG TCT GTC AAA GAG AAA AGC ATC AGT CTG 
Met Asn Pro Asn Arg Cys Pro Ser Val Lys Glu Lys Ser lie 2er Leu 
35 40 45, 



563 



GGA GAG 
Gly Glu 



GCA GCT 
Ala Ala 
50 



CCC CAG 
Pro Gin 



CAA GAG 
Gin Glu 
55 



AGC AGT 
Ser Ser 



CCC TTA AGA 
Pro Leu Arg 
60/ 



Jlu 



AAT GTT 
Asn Val 



611 



GCC TTA CAG CTG GGA CTG AGC CCT TCC AAG ACC TTT WC AGG CGG AAC 
Ala Leu Gin Leu Gly Leu Ser Pro Ser Lys Thr Phe per Arg Arg Asn 
65 70 75/ 



659 



CAA AAC TGT GCC GCA GAG ATC CCT CAA GTG GTT Gj(A ATC AGC ATC GAG 
Gin Asn Cys Ala Ala Glu lie Pro Gin Val Val JGlu lie Ser lie Glu 
80 85 90/ 95 



707 



AAA GAC 
Lys Asp 



AGT GAC 
Ser Asp 



TCG GGT 
Ser Gly 
100 



GCC ACC 
Ala Thr 



CCA GGA A{?G 
Pro Gly 
105 



AGG CTT GCA 
'hr Arg Leu Ala 



CGG AGA 
Arg Arg 
110 



755 



GAC TCC TAC TCG CGG CAC GCC CCG TGG GQfA GGA AAG AAG AAA CAT TCC 
Asp Ser Tyr Ser Arg His Ala Pro Trp (sly Gly Lys Lys Lys His Ser 
115 120/ 125 



803 



TGT TCC ACA AAG ACC CAG AGT TCA TTG GAT ACC GAG AAA AAG TTT GGT 
Cys Ser Thr Lys Thr Gin Ser Ser I^eu Asp Thr Glu Lys Lys Phe Gly 
130 135/ 140 



851 



AGA ACT CGA AGC GGC CTT CAG AG/6 CGA GAG CGG CGC TAT GGA GTC AGC 
Arg Thr Arg Ser Gly Leu Gin Arg Arg Glu Arg Arg Tyr Gly Val Ser 
145 150 / 155 



899 



TCC ATG CAG GAC ATG GAC AGO' GTT TCT AGC CGC GCG GTC GGG AGC CGC 
Ser Met Gin Asp Met Asp Sfr Val Ser Ser Arg Ala Val Gly Ser Arg 
160 165 / 170 175 



947 



TCC 
Ser 



CTG 
Leu 



AGG CAG 
Arg Gin 



AGG CTC 
Arg Lei 
180 



/CAG GAC 
Gin Asp 



ACG GTG 
Thr Val 
185 



GGT TTG TGT TTT 
Gly Leu Cys Phe 



CCC ATG 
Pro Met 
190 



995 



AGA ACT 
Arg Thr 



TAC AGC 
Tyr Ser 
195 



AAG 
Lys 



TCA AAG 
Ser Lys 



CCA CTC 
Pro Leu 
200 



TTT TCC AAT AAA 
Phe Ser Asn Lys 
205 



AGA AAA 
Arg Lys 



1043 



ATA CAT CTT TCT GAAJ TTA ATG CTG GAG AAA TGC CCT TTT CCT GCT GGC 
lie His Leu Ser Glm Leu Met Leu Glu Lys Cys Pro Phe Pro Ala Gly 
210 / 215 220 



1091 



TCG GAT TTA GCA Ci^A AAG TGG CAT TTG ATT AAA CAG CAT ACC GCC CCT 
Ser Asp Leu Ala G£Ln Lys Trp His Leu lie Lys Gin His Thr Ala Pro 
225 / 230 235 



1139 



GTG AGC 
Val Ser 
240 



CCA CAC 
Pro His 



fCA ACA 
Ser Thr 
245 



TTT TTT 
Phe Phe 



GAT ACA 
Asp Thr 



TTT GAT CCA TCA 
Phe Asp Pro Ser 
250 



CTG GTG 
Leu Val 
255 



1187 



TCT 
Ser 



ATC 
He 



GAA 
Glu 



GCT 
Ala 



GCS 
Xaa 
320 

CGG 
Arg 



AGC 
Ser 



TGC 
Cys 



GGC 
Gly 



GAA 
Glu 
400 

TCT 
Ser 



CAG 
Gin 



CAC 
His 



TCT 
Ser 



ACA GAA 
Thr Glu 



GAA GAA 
Glu Glu 



GCT ACT 
Ala Thr 
290 

CCT GGG 
,Pro Gly 
305 

AAT TGT 

Asn Cys 



AGG CAG 
Arg Gin 



AGA CAG 
Arg Gin 



GAT GAA GAA 
Asp Glu Glu 
260 

GGG GTG GAT 
Gly Val Asp 
275 

GCA CAG GTC 
Ala Gin Val 



ATG ACA GAG 
Met Thr Glu 



GAC TCA GAA 
Asp Ser Glu 
325 

AAG CAG CGC 
Lys Gin Arg 
340 



GAT AGG 
Asp Arg 



CCC CCT 
Pro Pro 



AAC CCA 
Asn Pro 
295 

ATA AGT 
He Ser 
310 

GAG GAT 
Glu Asp 



CAG Gj?G 
Gin 77al 



- 130- 

CTT CGC GAG 
Leu Arg Glu 
265 

CCC AAC GCA 
Pro. Asn Ala 
280 

TTG TAT 
Leu Tyr /Lys 



GGA GAT GGT 
Gly Asp Gly 



:a ACC ACC 
Ser Thr Thr 
330 

TCC GGG GAC 
Ser Gly Asp 
345 



AGA 
Arg 



TCT 
Ser 
315 

CTA 
Leu 



AGC 
Ser 



AGA /CGG CTT AGT 
AiQg Arg Leu Ser 
270 

ATA CAC ACC TTT 
He His Thr Phe 
285 

GGA CCA AAG TTA 
Gly Pro Lys Leu 
300 

GCA ATT CCA CAA 
Ala He Pro Gin 



TGT CTG 
Cys Leu 



CAC GCG 
His Ala 



CAG TCA 
Gin Ser 
335 

CAC GTT 
His Val 
350 



CTC GTG 
Leu Val 
370 

GTG ATG 
Val Met 
385 

GGC ACG 
Gly Thr 



GTG AGC 
Val Ser 



TGG AAC 
Trp Asn 



TCC TCC 
Ser Ser 
450 

TGC ATG 
Cys Met 
465 




GGA GCT TGG 
Gly Ala Trp 
355 



CCA GAT TT 
Pro Asp L 



GAC CGA /TAC 
Asp Arg/ Tyr 



GAG GCC 
Glu Ala 
390 

AGG GAC 
Arg Asp 



TTC /CGC CGC 
Phe/ Arg Arg 
420 

CAt AAC TTC 
Hiis Asn Phe 
435 

CW GTC ACG 
Val Thr 



'TT TTT GAA 
>he Phe Glu 



TAC AAC 
Tyr Asn 



AGC TTC 
Ser Phe 



GGG CTT 
Gly Leu 
455 

CCG TTG 
Pro Leu 
470 



CAT ACG CAG 
His Thr Gin 
360 

ATC ACA GGG 
He Thr Gly 



GAA GCC CTT 
Glu Ala Leu 



TCT GCA CAG 
Ser Ala Gin 
410 

AGG TCT CTG 
Arg Ser Leu 
425 

GAT GCC CAT 
Asp Ala His 
440 

CTC GAA CAC 
Leu Glu His 



CTA ACG ATA 
Leu Thr He 



ATC 
He 



AAT 
Asn 



CTA 
Leu 
395 

GAG 
Glu 



CAC 
His 



GAC 
Asp 



TAT 
Tyr 



TCA 
Ser 
475 



GAT TAC 
Asp Tyr 
365 

CCC TGT 
Pro Cys 
380 

GAA GGG 
Glu Gly 



ATA CAC 
He His 



TAC TGG 
Tyr Trp 



AAA CCG 
Lys Pro 



GAC TAC CTC TTC 
Asp Tyr Leu Phe 
415 

GCC CGG ATC GAG 
Ala Arg He Glu 
430 

CCC TGC GTG TTT 
Pro Cys Val Phe 
445 

AAA GAC CCC AGC 
Lys Asp Pro Ser 
460 

CTG AAT AGA ACT 
Leu Asn Arg Thr 



1235 



1283 



1331 



1379 



1427 



1475 



1523 



1571 



1619 



1667 



1715 



1763 



1811 



1859 



i 



1 



131 



TTC CCT TTC AGC CTG CAG TAT ATC TGC CGC GCA GTG ATC TGC AGA TGC 
Phe Pro Phe Ser Leu Gin Tyr lie Cys Arg Ala Val lie' Cys Arg Cys 
480 485 490 / 495 



ACT ACG TAT GAT GGG ATT GAC GGG CTC CCG CTA 
Thr Thr Tyr Asp Gly lie Asp Gly Leu Pro Leu 

500 505 



CC# TCG ATG TTA CAG 
o Ser Met Leu Gin 
510 



GAT TTT TTA AAA GAG TAT CAT TAT AAA CAA 
Asp Phe Leu Lys Glu Tyr His Tyr Lys 
515 520 



GTT AGG GTT CGC TGG 
Gln/Lys Val Arg Val Arg Trp 

525 



TTA GAA CGA GAR CCA GTC AAA GCA AAG yTAACTCCTGT CCCCAAAGGG 
Leu Glu Arg Xaa Pro Val Lys Ala Ly^ 
530 535 



1907 



1955 



2003 



2050 



CACTAACTAA 


GTCTGCTCCT 


C C C GTGC ATGr 


MGAACTGCAC 


C C AT AGGRAG 


GCAGTCAGCT 


2110 


GCTAGGATTT 


CCCACCCAGA 


ATGGGAGQTT 


AGTCATTAGC 


CTCTGCCCTA 


TGGGGTCCGC 


2170 


TGTTCCTCAG 


ACAAAGGTGC 


CTAGGGACAG 


CAAGATGGCT 


TGCAGGTGTT 


CGGTGGGCTG 


2230 


TGACAACTGA 


GGGAGGCAAC 


TCTOGGGCAT 


TTGCTATGAA 


GAATTCTATT 


TCTTACCGAA 


2290 


GAACAAATTA 


TTAATATTGG 


ATGGGTATTT 


CAATAGTGTG 


ACTAATGTTT 


GAAATTATTT 


2350 


TTTCTAAGAA 


TTTTTCTATA 


ACCTTCAGAA 


AAAGTAGTGA 


TGTTTGTAGT 


TACTATAAAT 


2410 


CAAGCTTTGA 


AAGTTCAAZy^ 


CAAACAAGTT 


AAATAAAAGA 


CTACCTTCCT 


TTTAGAGAAA 


2470 


ACAAATGCAA 


GTTTTCCOAG 


CCACAGGCAT 


TGTGCACTGT 


TAATGTTAGC 


TTGTTATCAG 


2530 


CTCCTTTCTC 


CTCC / 










2544 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

/(A) LENGTH: 53 6 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) /MOLECULE TYPE: protein 

(xi/ SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asp ays Val Gly Lys Met Trp Asn Asn Leu Lys Tyr Arg Cys Gin 
1/5 10 15 

Asn Leu/ Phe Ser His Glu Gly Gly Ser Arg Asn Glu Asn Val Glu Met 
20 25 30 

Asn Pr£ Asn Arg Cys Pro Ser Val Lys Glu Lys Ser lie Ser Leu Gly 
35 40 45 



/ 
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Glu Ala 
50 

Leu Gin 
65 

Asn Cys 
Asp Ser 
Ser Tyr 



Ala Pro Gin Gin 
Leu Gly 
Ala Ala 



Leu Ser 
70 



Asp Ser 
100 

Ser Arg 
115 



Ser Thr 
130 



Lys Thr 
Ser Gly 
Met Gin Asp Met 



Thr Arg 
145 



Leu Arg 
Thr Tyr 



His Leu 
210 

Asp Leu 
225 



Gin Arg 
180 

Ser Lys 
195 



Glu lie 
85 

Gly Ala 

His Ala 

Gin Ser 

Leu Gin 
150 

Asp Ser 
165 

Leu Gin 
Gin Ser 



Ser Glu Leu Met 
Ala Gin 



Ser Pro His Ser 



Thr Glu 
Glu Glu 



Ala Thr 
290 

Pro Gly 
305 



Asp Glu 
260 

Gly Val 
275 

Ala Gin 
Met Thr 



Lys Trp 
230 

Thr Phe 
245 

Glu Asi 



Asn Cys Asp Ser 



GJu lie 
310 

Slu Glu 

125 



Glu Ser 
55 

Pro Ser 



Pro Gin 

Thr Pro 

Pro Trp 
120 

Ser Leu 
135 

Arg Arg 
Val Ser 
Asp Thr 



Lys Pre 
20/ 

Leu QjIu 
215 



> Asp 

Arg Leu 

Pro Pro 
280 

Pro Leu 
295 

Ser Gly 
Asp Ser 



Ser Pro 

Lys Thr 

Val Val 
90 

Gly Thr 
105 

Gly Gly 
Asp Thr 
Glu Ari 



Ser /Arg 
170 

Ll Gly 
15 



Leu Arg 
60 

Phe Ser 
75 



Glu Asn Val Ala 



Arg Apg Asn Gin 
80 



Glu lie Se/£ lie Glu Lys 

95 



Arg Lei 
Lys /Lys 



Lys 
140 

Arg Tyr 
155 

Ala Val 



Leu Cys 



Leu Phe Ser Asn 
Lys Cys 
lie Lys 



Pro Phe 
220 



Gin His 
235 



Thr Phe 
250 



Asp Pro 
Arg Arg 
Asn Ala Gin lie 



Arg Glu 
265 



Tyr Lys 
Asp Gly 



Thr Thr 
330 



Leu Gly 
300 

Ser Ala 
315 

Leu Cys 



Ala Arg Arg Asp 
110 

Lys His Ser Cys 
125 

Lys Phe Gly Arg 



Gly Val Ser Ser 
160 

Gly Ser Arg Ser 
175 

Phe Pro Met Arg 
190 

Lys Arg Lys lie 
205 

Pro Ala Gly Ser 



Thr Ala Pro Val 
240 

Ser Leu Val Ser 
255 

Arg Leu Ser lie 
270 

His Thr Phe Glu 
285 

Pro Lys Leu Ala 



lie Pro Gin Xaa 
320 

Leu Gin Ser Arg 
335 



Arg Gin Lys Gin Arg Gin Val 
340 

Arg Gin Gly Ala Trp Lys Val 
355 

Leu Val Pro Asp Leu Leu Gin 
370 375 



Ser Gly Asp Ser 
345 

His Thr Gin lie 
360 



His Ala yis 
>0 

Asp T#r lie 
5 



lie Thr Gly Asn P; 



f o Cys Tyr 
0 



Val Met Asp Arg Tyr Glu 
385 390 

Gly Thr Phe Leu Leu Arg 

405 

Val Ser Phe Arg Arg Tyr 
420 

Trp Asn His Asn Phe Ser 
435 

Ser Ser Xaa Val Thr Gly 
450 

Cys Met Phe Phe Glu Pro 
465 470 

Pro Phe Ser Leu Gin Tyr 

485 



Thr Tyr Asp Gly lie Asp/Gly 
500 

Phe Leu Lys Glu Tyr H/l s Tyr 
515 

Glu Arg Xaa Pro Val/ Lys Ala 
530 / 535 



Ala Glu Ala Leu Le; 
Asp 



Asn 



Phe 



Leu 
455 

Leu 



Ser Ala Gln/Glu 
4 

Arg Ser J£eu His 
425 

Asp A/a His Asp 
440 

Lei/ Glu His Tyr 



jeu Thr lie Ser 
475 

Cys Arg Ala Val 
490 

Leu Pro Leu Pro 
505 

Lys Gin Lys Val 
520 

Lys 



Glu Gly Lys 

Asp Tyr Leu 

Ala Arg lie 
430 

Pro Cys Val 
445 

Lys Asp Pro 
460 

Leu Asn Arg 
lie Cys Arg 



Ser Met Leu 
510 

Arg Val Arg 
525 



Val Ser 

His Cys 

Trp Gly 

Pro Glu 
400 

Phe Ser 
415 

Glu Gin 

Phe His 

Ser Ser 

Thr Phe 
480 

Cys Thr 
495 

Gin Asp 
Trp Leu 



(2) INFORMATION BOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1221 base pairs 

(B) yYPE: nucleic acid 

( C ) STRANDEDNE S S : s i ng 1 e 

(D) /TOPOLOGY : linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GATTAAACAG (EATACAGCTC CTGTGAGCCC ACATTCAACA TTTTTTGATA CTTTGATCCA 



134 



TCTTTGGTTT 


CTACAGAAGA 


TGAAGAAGAT 


AGGCTTAGAG 


AGAGAAGGCG 


GGTTAGTATT 


120 


GAAGAAGGGG 


TTGATCCCCC 


TCCCAATGCA 


CAAATACATA 


CATTTGAAGG/ 


TACTGCACAG 


180 


GTTAATCCAT 


TATTAAACTG 


GGACCAAAAT 


TAGCTCCTGG 


AATGACT^AA 


ATAAGTGGGG 


240 


ACAGTTCTGC 


AATTCCACAA 


GCTAATTGTG 


ACTCGGAAGA 


GGATACAACC 


ACCCTGTGTT 


300 


GCAGTCACGG 


AGGCAGAAGC 


AGCGTCAGAT 


ATCTGGAGAC 


AG^CATACCC 


ATGTTAGCAG 


360 


ACAGGGAGCT 


TGGAAAGTCC 


ACACACAGAT 


TGATTACATA 


/actgcttcg 


TGCCTGATTT 


420 


GCTTCAAATT 


ACAGGGAATC 


CCTGTTACTG 


GGGAGTGATG 


GACCGTTATG 


AAGCAGAAGC 


480 


CCTTCTCGAA 


GGGAAACCTG 


AAGGCACGTT 


TTTGCTGAGG 


GACTCTGCGC 


AAGAGGACTA 


540 


CTTCTTCTCT 


GTGAGCTTCC 


GCCGATACAA 


CAGATJCCCTG 


CATGCCCGAA 


TTGAGCAGTG 


600 


GAATCACAAC 


TTTAGTTTCG 


ACGCCCATGA 


CCCffTGTGTA 


TTTCACTCCT 


CCACTGTAAC 


660 


GGGACTTTTA 


GAACATTATA 


AAGATCCCAG 


TTCGTGCATG 


TTTTTTGAAC 


CATTGCTTAC 


720 


TATATCACTA 


AATAGGACTT 


TCCCTTTTAG 


/CCTGCAGTAT 


ATCTGTCGCG 


CGGTAATCTG 


780 


CAGGTGCACT 


ACGTATGATG 


GAATTGATGG 


GCTCCCTCTA 


CCCTCAATGT 


TACAGGATTT 


840 


TTTAAAAGAG 


TATCATTATA 


AACAAAA^GT 


TAGAGTTCGC 


TGGTTGGAAC 


GAGAACCAGT 


900 


CAAGGCAAAG 


TAAACTCTCC 


GGTCCQCAAA 


GGGTGTTAAC 


TAGGTCCGCT 


TTCATGTGCA 


960 


TCAGACAGTA 


CACCTATAGC 


AAGCACACGT 


AGCAGTGTTA 


GGCTTTTTCA 


TACAGTATGT 


1020 


AAGCTTAGTG 


TTAGTATCTG 


TC^GATGCTA 


CCTGCTGTTA 


CTTATTCAGA 


TAAACATGGT 


1080 


GCCTATTGGA 


ACAATAGCGG 


ATAGAGCTAC 


AGGTGTTCAG 


TAAGACTACA 


AAAACATTTT 


1140 


GCCTATTTCG 


CTAACAGTTT 


GGTTTTTAAT 


GGCTGTGGTA 


TTTGAGTGAG 


GCAACTGTGG 


1200 


GGCATTTGTT 


atgaagaaat/ 


G 








1221 



(2) INFORMATION FOR SEQ ID NO: 20: 

( i ) SEQUENCE /CHARACTERISTICS : 

(A) LENGTH: 23 69 base pairs 

(B) TYHE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 



- 135- 



(B) LOCATION: 116.. 1330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGCACGAGGC GGTGGTGGCG GCGGCGGGCG CGGCCGCGGC GGGGCGGG^G CGGAATGAAG 60 

GCCCACGGCC CTGGGGGCTG AGGCGCCCGC CGCCTGGGGC GGGCCGCGCG TCCTC ATG 118 

Met 
1 

GAG GCC GGA GAG GAG CCG CTG CTG CTG GCT GAA yCTC AAG CCT GGG CGC 166 
Glu Ala Gly Glu Glu Pro Leu Leu Leu Ala Glj/ Leu Lys Pro Gly Arg 
5 10/15 

CCC CAC CAG TTC GAC TGG AAG TCA AGC TGC/GAG ACC TGG AGC GTG GCC 214 
Pro His Gin Phe Asp Trp Lys Ser Ser Cy£ Glu Thr Trp Ser Val Ala 
20 25 / 30 

TTC TCG CCA GAC GGT TCC TGG TTC GCC/TGG TCT CAA GGA CAC TGC GTG 2 62 

Phe Ser Pro Asp Gly Ser Trp Phe ALa Trp Ser Gin Gly His Cys Val 
35 40 / 45 

GTC AAG CTG GTC CCC TGG CCC TT A/GAG GAA CAG TTC ATC CCT AAA GGA 310 
Val Lys Leu Val Pro Trp Pro LeU Glu Glu Gin Phe lie Pro Lys Gly 
50 55 / 60 65 

TTC GAA GCC AAG AGC CGA AGC /AGC AAG AAT GAC CCA AAA GGA CGG GGC 3 58 

Phe Glu Ala Lys Ser Arg Set Ser Lys Asn Asp Pro Lys Gly Arg Gly 

70 / 75 80 

AGT CTG AAG GAG AAG ACG fTG GAC TGT GGC CAG ATT GTG TGG GGG CTG 406 
Ser Leu Lys Glu Lys Thr/Leu Asp Cys Gly Gin lie Val Trp Gly Leu 
85 / 90 95 

GCC TTC AGC CCG TGG CfC TCT CCA CCC AGC AGG AAA CTC TGG GCA CGT 454 
Ala Phe Ser Pro Trp Pro Ser Pro Pro Ser Arg Lys Leu Trp Ala Arg 
100 / 105 110 

CAC CAT CCC CAG GCG/ CCT GAT GTT TCT TGC CTG ATC CTG GCC ACA GGT 502 
His His Pro Gin Alp. Pro Asp Val Ser Cys Leu lie Leu Ala Thr Gly 
115 7 120 125 

CTC AAC GAT GGG dAG ATC AAG ATT TGG GAG GTA CAG ACA GGC CTC CTG 550 
Leu Asn Asp Gly /Gin lie Lys lie Trp Glu Val Gin Thr Gly Leu Leu 
130 / 135 140 145 

CTT CTG AAT CTT/ TCT GGC CAC CAA GAC GTC GTG AGA GAT CTG AGC TTC 598 
Leu Leu Asn Lei Ser Gly His Gin Asp Val Val Arg Asp Leu Ser Phe 
f 150 155 160 

ACG CCC AGC GGC AGT TTG ATT TTG GTC TCT GCA TCC CGG GAT AAG ACA 646 
Thr Pro Ser GiLy Ser Leu lie Leu Val Ser Ala Ser Arg Asp Lys Thr 
1/65 170 175 



- 136- 



CTT CGA ATT 
Leu Arg lie 
180 

TCC GGC CAT 
Ser Gly His 
195 

AGC ATG CTG 
Ser Met Leu 
210 

ATG CGG TCC 
Met Arg Ser 



GTT GTC TCC 
Val Val Ser 



TGG GAC CTG 
Trp Asp Leu 



CTG CAG TGG 
Leu Gin Trp 



TCG 
Ser 



CTG 
Leu 



GAC 
Asp 
290 

TTG 
Leu 



TAT 
Tyr 



AGG 
Arg 
275 

GTC 
Val 



GAC 
Asp 
260 

TCA 
Ser 



CAC 
His 



TGC TCT GCA 
Cys Ser Ala 
215 

TAC ACA CTA 
Tyr Thr Leu 
230 

TGT GAT TTC 
Cys Asp Phe 
245 

ACC AGT GTG 
Thr Ser Val 



CTT CAT CAC 
Leu His His 



AAT AAA CAC 
Asn Lys His 
185 

GTT TAC TGC 
Val Tyr Cys 
200 

GCT GGG GAG 
Ala Gly Glu 



ATC CGG AAA 
lie Arg Lys 



GGT 
Gly 



TGC 
Cys 



AAG 
Lys 



CTA 
Leu 
235 



AAG CAG ATC CAG 
Lys Gin lie Gin 
190 



TCC ATC TCC 
Ser lie Ser y 
205 



Pro 



GTG/TTA 
Val Leu 



GAC TGT 
Asp Cys 



TCG GTC/TTT CTG 
Ser Vai Phe Leu 
220 

GA^ GGC CAC CAA 
Lu Gly His Gin 



TCT CCT GAT 
Ser Pro Asp 
250 

ATT ATG TG)S 
lie Met 'prp 
265 



GAC 
Asp 



GCC TTG 
Ala Leu 



CCC TAC 
Pro Tyr 



ACA CI 
Thr 
280 



CTT 
Leu 



TAT CTC 
Tyr Leu 



CTG GAA CTG 
Leu Glu Leu 



TGC 
Cys 



TGC 
Cys 



GAT 
Asp 



AAG 
Lys 
370 

GTC 
Val 



ATG AGC TCC 
Met Ser Ser 
295 

GCT ACG GTG> 
Ala Thr V 
310 

AAG GCT tCG 
Lys Ala/ Pro 
325 

TTC Tjf'C CCA 
Phe Phe Pro 



GTGT CAG TTC 
Vc£L Gin Phe 



CT</ AGG TCC 
Arg Ser 



GAA 
Glu 



GTG 
Val 



GCA GAT GAC 
Ala Asp Asp 



CAC 
His 



TTA 
Leu 



CTA GCi 
Leu Al< 



i/GC AGG AAA 
:ys Arg Lys 
375 

CTG CCA ATC 
Leu Pro lie 
390 



GTT GCC TTT 
Val Ala Phe 
330 

CAC GGT GGA 
His Gly Gly 
345 

TGG ACA GCT 
Trp Thr Ala 
360 

GCC CTC CGA 
Ala Leu Arg 



AGG 
Arg 
315 

GCT 
Ala 



CCC ACC 
Pro Thr 
285 

TGC TTC 
Cys Phe 
300 

CTG CTC 
Leu Leu 



CTT GTC 
Leu Val 
255 

ACC GGC 
Thr Gly 
270 

ATG GAT 
Met Asp 



TGG AGC 
Trp Ser 
225 

AGC AGT 
Ser Ser 
240 

ACA GCT 
Thr Ala 



TCA CCT 
Ser Pro 



AGG ATC 
Arg lie 



GCG AGG 
Ala Arg 



GAC AGT 
Asp Ser 



GAA GGC 
Glu Gly 
305 

TGG GCT 
Trp Ala 
320 



CCC AAG AAG 
Pro Lys Lys 



ATT 
He 



CCC 
Pro 



AGT 
Ser 



ATG 
Met 
395 



CCG ATG 
Pro Met 



ATT GCC 
He Ala 



CGG GTC 
Arg Val 
365 

TTC CTG 
Phe Leu 
380 

AAA GAG 
Lys Glu 



ACC AAT 
Thr Asn 
335 

ACA GGG 
Thr Gly 
350 

CTG TCC 
Leu Ser 



ACA ACG 
Thr Thr 



TTC CTC 
Phe Leu 



GGT CTT 
Gly Leu 



ACG AGA 
Thr Arg 



TCA CTG 
Ser Leu 



TAT CAA 
Tyr Gin 
385 

ACA TAC 
Thr Tyr 
400 



694 



742 



790 



838 



886 



934 



982 



1030 



1078 



1126 



1174 



1222 



1270 



1318 
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AGG ACT TTC 


TAGCAGTGCC 


GGCTCCCCCA 


CCTCCTGCAG 


CAGCAGCAGT 




1367 


Arg Thr Phe 

405 












ACAAGGGACT 


GGCTAGGATG 


GAGTCAGGCA 


GCTCACACTG 


GACCAGJFGTG 


GACCTTCCTT 


1427 


CCTCCCATGG 


CATGTGCAAG 


TAGGTCTGCG 


TGACCCCACT 


TCTJ^TGGTGC 


CGGCCTTACC 


148 / 


TCGTCTTCAT 


CCGTGGTGAG 


CAGCCTTCGT 


CAGTCTAGTT 


GTGTTGAAGC 


CAAGTGCAGT 


1547 


TGTGGATGTT 


GCTGGGGTAA 


TAAAGGCAAG 


CGGGCTCCAG 


AGCCTCTCTG 


GTGGCGGCCA 


1607 


AGCCACACTC 


CCTTAACTGG 


GAAGTACCTG 


CCACGT&GGG 


CATTTCTGCT 


GCCTATTTCC 


1667 


AGCCAGCGGC 


TGCATGGTTT 


GAAGTTCCTC 


CGTTGjTGGTC 


AGAAGAACTC 


TGGTGTTTGG 


1727 


TTCCCTGCTC 


AGCTGCGCGT 


GGACTGGGCT 


gac^ctcctca 


CCATACACTA 


GTGCCGGCTT 


1787 


TTGTTTCCTG 


TAAACAGTGG 


TTGCATGTGT 


^gagaagtaa 


CAAGCGAGTA 


TTCAGATCAT 


1847 


ACGAGGAGGC 


GTTCCTCGGT 


gcatgacggt/ 


CAGATGGCCA 


TTTATCAGCA 


TATTTATTTG 


1907 


TATTTTCTCA 


GCACATAGTA 


AGGTACAACT 


GTGTTTTCTC 


AATTGTCTCG 


AAAAAACAGA 


1967 


GTTCTTAAGT 


GGCCCAGTTG 


TGGAGOCAAG 


TCTAAGTCGT 


GTGGAGTCAG 


TGCTGACATC 


2027 


ACTGGCTTGT 


GCTGTCTGTC 


ACATSTGTTT 


GTCTCTGCTG 


CTTGACCTCA 


TGGGATGTAC 


2087 


CCTCCAGTTC 


AACTGCCCAA 


AAOAGACAGC 


CCCTTCCAAG 


CACCGTTCTT 


TGACAGCGGT 


2147 


AGCAGCTACC 


TATTCAAGAC 


GfCCTCACACA 


AAATCTGCCT 


TAGAAAGTTA 


ATATATTTTA 


2207 


AATTATTTTA 


AAAGAAACTC 


Aacatcttat 


TCTTTGGCCT 


TTCTTAATTG 


ATGCTTTATG 


2267 


GAGGCAGTGT 


TAACATTGTA 


CAGTGTATGC 


ATAGAGGAGT 


CTCCTCTATT 


TGAAGAACAA 


2327 


TGCAAAATGA 


GGCTTTCM'T 


GAAGGGAAAA 


AAAAAAAAAA 


AA 




2369 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(M LENGTH: 404 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Glu Ala Gly Glu Glu Pro Leu Leu Leu Ala Glu Leu Lys Pro Gly 
1/5 10 15 

Ara Pro His /Gin Phe Asp Trp Lys Ser Ser Cys Glu Thr Trp Ser Val 
20 25 30 



Ala Phe 

Val Val 
50 

Gly Phe 
65 

Gly Ser 

Leu Ala 

Arg His 

Gly Leu 
130 

Leu Leu 
145 

Phe Thr 

Thr Leu 

Leu Ser 

Cys Ser 
210 

Ser Met 
225 

Ser Val 
Ala Ser 
Arg Leu 



Ser Pro Asp 
35 

Lys Leu Val 



Glu Ala Lys 



Leu Lys Glu 
85 

Phe Ser Pro 
100 

His Pro Gin 
115 

Asn Asp Gly 



Leu Asn Leu 



Pro Ser Gly 
165 

Arg lie Trp 
180 

Gly His Lei 
195 

Met Leu ^ys 



Arg Ser Tyr 



Ser Asp 

290; 

Gly Le\ 
305 



Val per Cys 
245 

Ty/ Asp Thr 
260 

:g Ser Leu 
^75 

/al His Met 



Gly Ser 

Pro Trp 
55 

Ser Arg 
70 

Lys Thr 

Trp Pro 

Ala Pro 

Gin lie 
135 

Ser 
150 

Ser Zeu 

Bp Leu 

Gin Trp 

Ser Ala 
215 

Thr Leu 
230 

Asp Phe 
Ser Val 
His His 



Trp 
40 

Pro 



Ser 



Leu 



Ser 
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Phe Ala Trp Ser 



Gin >Gly His Cys 
4! 



Asp 
120 



Leu Glu Glu Gln y 

Ser Lys Asn/Asp 
7 

Asp Cys /Gly Gin 
9 

Pro ;pro Ser Arg 
105, 

val Ser Cys Leu 



Phe He Pro Lys 



Tyr Leu Ala 



Ser Ser 
295 

Thr Val 
310 



His 

He 

Asn 

Val 
200 

Ala 

He 

Ser 

He 

Thr 
280 

Leu 
Ala 



He Trp Glu Val 
140 

Gin Asp Val Val 
155 

Leu Val Ser Ala 
170 

Lys His Gly Lys 
185 

Tyr Cys Cys Ser 



Pro Lys Gly Arg 
80 

He Val Trp Gly 
95 

Lys Leu Trp Ala 
110 

He Leu Ala Thr 
125 

Gin Thr Gly Leu 



Gly Glu Lys Ser 
220 

Arg Lys Leu Glu 
235 

Pro Asp Ser Ala 
250 

Met Trp Asp Pro 
265 

Gin Leu Glu Pro 



Arg Asp Leu Ser 
160 

Ser Arg Asp Lys 
175 

Gin He Gin Val 
190 

He Ser Pro Asp 
205 

Val Phe Leu Trp 



Arg Ser Val Cys 
300 

Asp Asp Arg Leu 
315 



Gly His Gin Ser 
240 

Leu Leu Val Thr 
255 

Tyr Thr Gly Ala 
270 

Thr Met Asp Asp 
285 

Phe Ser Pro Glu 



Leu Arg He Trp 
320 
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Ala Leu Glu Leu Lys Ala Pro Val Ala Phe Ala Pro Met Th/Asn Gly 

325 330 / 335 

Leu Cys Cys Thr Phe Phe Pro His Gly Gly He He Al/ Thr Gly Thr 
340 345 / 350 

Arq Asp Gly His Val Gin Phe Trp Thr Ala Pro Arc^Val Leu Ser Ser 
355 360 / 365 

Leu Lys His Leu Cys Arg Lys Ala Leu Arg Ser yfche Leu Thr Thr Tyr 
370 375 / 380 

Gin Val Leu Ala Leu Pro He Pro Lys Lys lYf4t Lys Glu Phe Leu Thr 
385 390 /395 400 

Tyr Arg Thr Phe 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1246 base ybairs 

(B) TYPE: nucleic ac: 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linea^ 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GACACTGCAT CGTCAAACTG ATJCCCCTGGC CGTTGGAGGA 
TTGAAGCCAA AAGCCGAAGT ^GCAAAAATG AGACGAAAGG 
AGACGCTGGA CTGTGGTCAG/ ATTGTCTGGG GGCTGGCCTT 
CAGCAGGAAG CTCTGGGCAC GCCACCACCC CCAAGTGCCC 
TGCTACGGGA CTCAACGM?G GGCAGATCAA GATCTGGGAG 
TTTGAATCTT TCCGGC</aCC AAGATGTCGT GAGAGATCTG 
TTTGATTTTG GTCTCQfGCGT CACGGGATAA GACTCTTCGC 
CGGTAAACAG ATTCA&VGTGT TATCGGGCCA CCTGCAGTGG 
CCCAGACTGC AGCATGCTGT GCTCTGCAGC TGGAGAGAAG 
GAGGTC CT AC ACGTTAATTC GGAAGCTAGA GGGCCATCAA 
CTTCTCCCCC GAGfTCTGCCC TGCTTGTCAC GGCTTCTTAC 
GGACCCCTAC AQCGGCGAAA GGCTGAGGTC ACTCCACCAC 



GCAGTTCATC 

GCGGGGCAGC 

CAGCCTGTGC 

GATGTCTCTT 

GTGCAGACAG 

AGCTTCACAC 

ATCTGGGACC 

GTTTACTGCT 

TCGGTCTTTC 

AGCAGTGTTG 

GATACCAATG 

ACCCAGGTTG 



CCTAAAGGGT 60 

CCAAAAGAGA 12 0 

TTTCCCCACC 180 

GCCTGGTTCT 2 40 

GGCTCCTGCT 3 00 

CCAGTGGCAG 3 60 

TGAATAAAC A 42 0 

GTTCCATCTC 480 

TATGGAGCAT 540 

TCTCTTGTGA 600 

TGATT ATGTG 660 

ACCCCGCCAT 72 0 



GGATGACAGT 
GTACCTTGCC 
TCCCATTGCA 
GAGTCATTGC 
TGTCCTCACT 
TCCTAGCACT 
CAACACCACA 
GGAATAATGG 
•AGAATGTAQC 



GACGTCCACA 
AC GGTGGC AG 
TTTGCTCCTA 
CACAGGGACA 
GAAGCACTTA 
GCCAATCCCC 
TCTTGTGCTT 
GCCAAACATC 
AAAACCAGAT 



TTAGCTCACT 
ATGACAGACT 
TGACCAATGG 
AGAGATGGCC 
TGCCGGAAAG 
AAGAAAATGA 
CTTTGTAGCA 
TGGTCTTGCA 
TCCAGTGTAC 
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GAGATCTGTG 

CCTCAGGATC 

GCTTTGCTGG 

ACGTCCAGTT 

CCCTTCGAAG 

AAGAGTTCCT 

GGGTAAATCG 

TTGAAATAG* 

TAGTCATSGA 



TGCTTCTCTC 
TGGGCCCTGG 
CACATTTT^T 
CTGGACAGCT 
TTTGCTAACA 
CiVCATACAGG 
rCCTGTCAAA 
ATTTCTTTGG 
TTTTTC 



C^jSAAGGCTT 
lCTGAAAAC 
CCACATGGTG 
CCTAGGGTCC 
ACTTACCAAG 
ACTTTTTAAG 
GGGAGTTGCT 
GATTGTGAAT 



(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 422 base paij/s 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sing] 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

ACCATGGTTC CAAGTCCTCT CCCCTCTGGT CAAGTTGCCC 

TTTTCCTCCT TGGGCCTCCC CTT^CTGACCT GCAGGACAGT 

TGAGGTATTA ATTAGCCTTA AGfTAAATTAC AGGGGACTCA 

TCCAGACACT ATTTTTTTTT TTTTTTTTTA ACAATGGTGT 

ATTTGTATGT CAGATTATAC/ AAGGATGTAT TCTTAAACCG 

CTGAGTTATC AGTGGCCAT/T TATTAGCATC ATATTTATTT 

AAGGTACAAC TGTGTTTTjTC TCGATTATCT AAAAAC CAT A 
AA 



GAATGTTGGG 
TTTCCGGAGC 
GAGGCCGTGC 
GCATGTGCAG 
CATGACTATT 
GTATTTTCTC 
GTACTTAAAT 



CCCAAGTGCC 
CCATTTGGTA 
TCCTGACCGA 
GAAATGACAA 
CAGATGGCTA 
AACAGATGTT 
TGAAAAAAAA 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1246 



60 
120 
180 
240 
300 
360 
420 
422 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2019 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



1 
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(D) TOPOLOGY: linear / 
(ii) MOLECULE TYPE: DNA / 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: / 

GGCACGAGGC GGGGTCAGGG CGGAGGCTGA GGACCAAGTA GGCATGGCGG AGGGCGGGAC 60 

CGGCCCCGAT GGACGGGCCG GCCCGGGACC CGCAGGTCCT AA^CTGAAGG AGTGGCTGAG 12 0 

GGAGCAGTTC TGTGACCATC CACTGGAGCA CTGTGACGAT ^CAAGACTCC ATGATGCAGC 180 

CTATGTAGGG GACCTCCAGA CCCTCAGGAA CCTACTGCa/ GAGGAGAGCT ACCGGAGCCG 240 

CATCAATGAG AAGTCTGTCT GGTGCTGCGG CTGGCTT0CC TGCACACCAC TGAGGATCGC 3 00 

AGCCACTGCA GGCCATGGGA ACTGTGTGGA CTTCCTCATA CGCAAAGGGG CCGAGGTGGA 360 

CCTGGTGGAT GTCAAGGGGC AGACTGCCCT GTA^GTGGCT GTAGTGAACG GGCACTTGGA 42 0 

GAGCACTGAG ATCCTTTTGG AAGCTGGTGC TGATCCCAAC GGCAGCCGGC ACCACCGCAG 480 

CACTCCTGTG TACCATGCCT YTCGTGTGGG /TAGGGACGAC ATCCTGAAGG CTCTTATCAG 540 

GTATGGGGCA GATGTTGATG TCAACCATOk TCTGAATTCT GACACCCGGC CCCCTTTTTC 600 

ACGGCGGCTA ACCTCCTTGG TGGTCTGTCC TCTATACATC AGTGCTGCCT ACCATAACCT 660 

TCAGTGCTTC AGGCTGCTCT TGCAQSCTGG GGCAAATCCT GACTTCAATT GCAATGGCCC 720 

TGTCAACACC CAGGAGTTCT ACAGteGGATC CCCTGGGTGT GTCATGGATG CTGTCCTGCG 780 

CCATGGCTGT GAAGCAGCCT TQGTGAGTCT GTTGGTAGAG TTTGGAGCCA ACCTGAACCT 840 

GGTGAAGTGG GAATCCCTGG jfiCCCAGAGGC AAGAGGCAGA AGAAAGATGG ATCCTGAGGC 900 

CTTGCAGGTC TTTAAAGAGGf CCAGAAGTAT TCCCAGGACC TTGCTGAGTT TGTGCCGGGT 9 60 

GGCTGTGAGA AGAGCTCt/g GCAAATACCG ACTGCATCTG GTTCCCTCGC TGCCGCTGCC 1020 

AGACCCCATA AAGAAGT/TTT TGCTTTATGA GTAGCATTCA CATGCAGTGC TGACTGCAAT 1080 

GTGGAAGCCG ATCACOTGCA GTGAAAACTG ACACAGACTC TGGCATCCTG GGAACCATGG 1140 

CCTGTGCTGC CAGCT3TGATC CTTGGCTGTC AGTGAAGAAA AAACGGCTGT GTTCTCTTGG 12 00 

ACTGTGATTC TATOTCAGGT GCTTGGGCCA TCGAACGCTC CTTGAGTCAT TGTCAACTGA 12 60 

GAGGCACATA CAAACTTAAT TTTGTTCCTC TTCAGTCTCT CTGTTTTGGA TTCTTCCTGG 1320 

CAATGTGTGC AGCATGGGCT GAGCCTGGTG ATTGCCCTAG TGGGGAAGGC TTTTTTCTCC 1380 

AGGCTATGCA TQTATTTATG TTCCTACTTT GCAATTTATT GTTCTTTTAA GGCTTGATAT 1440 

i 



CAAAACAGAA 
ACTTGCTAGC 
TTCTCTTTGC 
GTTCTATCTT 
AAGGGTTTTC 
ACCTGGAACT 
TCACTGCCAG 
GGAAGTTGGA 
CACTAARACC 
NCCCAATGCC 



AGAGGTTTGT 
CTGCTTTC C T 
TGCCACTGTT 
ATTGCACAGA 
TGATGTCTTA 
AACCATTAAG 
TCCTCACAAG 
AAATGTCTTG 
AGRATATCCT 
CNTTTGTKTN 



TAAGAAAAGA 
TGCCTGGGTT 
CTATTTTGGG 
GGTCCCAGAA 
CCCTGGGGAT 
TGACTGCCCA 
AGAAGATGAC 
GTCACCTGGG 
AGTTTTTTGG 
GGTTGCTTCC 
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TATAGGGAGA 
TGTCTGTCTA 
AGTTGTCTTC 
CAGTGTTCAT 
CTTCAGACAG 
CATTCAGATC 
ACGGGTGCTC 
TTGTTCCCAG, 
GTTGACTG1 
CTi 



AAGGAATTCC 
TGCTGCCTGG 
CGTCTAAG2 
AGGGCAQCAT 
TGGTTACCTT 
AGGGACCATC 
TOTTCAGACA 
'GCTACAACTT 
CCCTCCCCAC 



'CCGTGC 1500 

r GCACATCCC 1560 

GGCTTCTGGG 162 0 

CTGCTCTGCC 1680 

T AGG AG AC C C 1740 

TTAATAGTAC 1800 

CTCCCATACA 18 60 

CTTGGTGTTC 192 0 

TTTCCTTGAA 1980 
2019 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 amino ac/ds 

(B) TYPE: amino acid 

(C) STRANDEDNESS: sing/e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTK 

Ala Arg Gly Gly Val A/g 
1 5 

Glu Gly Gly Thr Gly/ Pro 
20 

Pro Asn Leu Lys G^u Trp 
35 

Glu His Cys Asp yksp Thr 
50 



SEQ ID NO: 
Ala Glu Ala 



Asp Gly Arg 
25 

Leu Arg Glu 
40 

Arg Leu His 
55 



Leu Gin Thr Leu Arg Asn Leu Leu Gin 
65 / 70 

lie Asn Glu Isfs Ser Val Trp Cys Cys 

85 



Glu Asp 
10 

Ala Gly 
Gin Phe 
Asp Ala 



Glu Glu 
75 

Gly Trp 
90 



Gin Val 
Pro Gly 



Cys Asp 
45 

Ala Tyr 
60 

Ser Tyr 



Leu Pro 



Leu Arg lie £la Ala Thr 
.00 



Ala Gly His 
105 



Gly Asn Cys Val 



Gly Met Ala 
15 

Pro Ala Gly 
30 

His Pro Leu 
Val Gly Asp 



Arg Ser Arg 
80 

Cys Thr Pro 
95 

Asp Phe Leu 
110 
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Ile Arg Lys 
115 

Ala Leu Tyr 
130 

Leu Leu Glu 
145 

Thr Pro Val 

Ala Leu lie 

Ser Asp Thr 
195 

Cys Pro Leu 
210 

Leu Leu Leu 
225 

Val Asn Thr 
Ala Val Leu 



Glu Phe Gly 
275 

Glu Ala Arg 
290 

Lys Glu Ala 
305 

Ala Val Arg 



Leu Pro Leu, 



Gly Ala Glu Val 



Val Ala Val Val 
135 

Ala Gly Ala Asp 
150 

Tyr His Ala Xaa 
165 

Arg Tyr Gly Ala 
180 

Arg Pro Pro Phe 



Asp Leu Val Asp Val 
120 

Asn Gly His Leu Gl) 

1> 

Pro Asn Gly Se^ Arg 
Ij 

Arg Val Gly/Arg Asp 
Ilk 

Val ,Asp Val Asn 
185/ 

:g Arg Leu Thr 



Asp 



Ser 
200 



Tyr lie Ser Ala 
215 

Gin Ala Gly Ala/Asn 
230 



Gin Glu Phe jyr Arg 
245 

Arg His G^ Cys Glu 
260 



Tyr His Asn Leu 
220 

Pro Asp Phe Asn 
235 

Gly Ser Pro Gly 
250 

Ala Ala Phe Val 
265 



Ala Asry Leu Asn Leu Val Lys Trp Glu 

280 



Gly l/rg Arg Lys Met 
295 

Argf Ser lie Pro Arg 
310 

:g Ala Leu Gly Lys 
325 

f Pro Asp Pro lie Lys 
340 



Asp Pro Glu Ala 
300 

Thr Leu Leu Ser 
315 

Tyr Arg Leu His 
330 

Lys Phe Leu Leu 
345 



Lys' Gly Gin Thr 



Ser Thr Glu lie 



His His Arg Ser 
160 

Asp lie Leu Lys 
175 

His His Leu Asn 
190 

Ser Leu Val Val 
205 

Gin Cys Phe Arg 



Cys Asn Gly Pro 
240 

Cys Val Met Asp 
255 

Ser Leu Leu Val 
270 

Ser Leu Gly Pro 
285 

Leu Gin Val Phe 



Leu Cys Arg Val 
320 

Leu Val Pro Ser 
335 

Tyr Glu 
350 



INFORMATION FOR SEQ ID NO : 2 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) /LENGTH: 419 base pairs 
(BWTYPE: nucleic acid 

(C) / STRANDEDNESS: single 

(D) / TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GCATCCATGG CGGAGGGCGG CAGCACGACG GGCGGGCAGG 
ATCTGAAGGA GTGGCTGAGG GAGCAATTTT GTGATCATCC 
CGAGGCTCCA TGATGCAGCT TACGTCGGGG ACCTCCAGAC 
AGGAGAGCTA CCGGAGCCGC ATCAACGAGA AGTCTGTCTG 
GCACACCGTT GCGAATCGCG GCCACTGCAG GCCATGGGAG 
GGAAGGGGGC CGAGGTGGAT CTGGTGGACG TAAAAGGACA 
TGGTGAACGG GCACCTAGAG AGTACCCAGA TCCTTCTCG^ 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



GCCGGGCTCC 


JGCAGGTCGTA 


60 


GCTGGAGCAC 


TGTGAGGACA 


120 


CCTCAGGAGC 


CTATTGCAAG 


180 


GTGCTGTGGC 


TGGCTCCCCT 


240 


CTGTGTGGAC 


TTCCTCATCC 


300 


GACGGCCCTG 


TATGTGGCTG 


360 


AGCTGGCGCG 


GACCCCAAC 


419 



(xi) SEQUENCE DESCRIPTION: /SEQ ID NO: 27 
GAGGAAGAAG AAAAGTGGAC CCTGAGGOCT TGCAGGTCTT 
CCAGAACCTT GCTGTGTCTG TGCCGTCfiTGG CTGTGAGAAG 
TCATCTGATT CCTTCGCTGC CTCTGCCAGA CCCCATAAAG 
GACTCCAAGT GCTGCGGTTG ATTCCAGTGA GGGAGAAAGT 
CCGAGCCCTG AGTGCTGTGC TGCTGCTGGT CTCCTGATGG 
CTCGTAGACT GTCATTGCTC CTGAGGTGCC TGGGCCGCTG 
CAGCTGAGAG GCTTATACTA AAGTTATTAT TGTTTTTCCC 
TCAGTTGCAT ATTAATGTAA CGGGCCATGG GGTATGTACA 
GCCTACTAAT TTCCTGTAGG GAAGACTCCC AGCACTTCTG 
TTTCTACTTC TCAATTTGAT JGGTTCGATTA AAGCCTTCTA 



TAAAGAGGCC 
AGCTCTTGGC 
AAGTTTCTAC 
GATCTGCAGG 
CTGTTGCTGC 
AACAGTCCTT 
AAGTTCTCTG 
TGTAGGGGCT 
GAACTGTGCT 
GTATCTCAAT 



AGAAGTGTTC 60 

AAAACCGGCT 12 0 

TCCATGAGTA 180 

GAGGTGGACA 240 

AGAAGATGTC 3 00 

GGGTCATTGT 3 60 

TTCTGGATTT 42 0 

GAGGTTGGAG 480 

TCTCTTTATT 540 

GAAAA 595 



(2) INFORMATION FOR S|EQ ID NO: 28 

(i) SEQUENCE CHi&RACTERISTICS: 
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(A) LENGTH: 896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 396 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

CTG ATG TCC GCA ATT CTG AAG GTT GGA CAC CAC TGC TGG CTG CCT GTG 
Met Ser Ala lie Leu Lys Val Gly His HrfLs Cys Trp Leu Pro Val 
1 5 /lO 15 

ACA TCC GCT GTC AAT CCC CAA AGG ATG CTG/AGG CCA CCA CCA ACC GCT 
Thr Ser Ala Val Asn Pro Gin Arg Met Len Arg Pro Pro Pro Thr Ala 

2 0 25 3 0 

GTT TTC AAC TGT GCC GCT TGC TGC TGT GfTG TGG GGG CAG ATG CTG ATG 
Val Phe Asn Cys Ala Ala Cys Cys Cys /Leu Trp Gly Gin Met Leu Met 
35 4(Y 45 

AAT ACA TAC CGT GTA GTT CAG CTT COT GAG GAG GCC AAG GGC TTG GTG 
Asn Thr Tyr Arg Val Val Gin Leu Pro Glu Glu Ala Lys Gly Leu Val 
50 55 / 60 

CCA CCA GAG ATT CTA CAG AAG TAC /CRT GGA TTC TAC TCT TCC CTC TTT 
Pro Pro Glu lie Leu Gin Lys Tyrf His Gly Phe Tyr Ser Ser Leu Phe 
65 70 / 75 

GCC TTG GTG AGG CAG CCC AGG TGFG CTG CAG CAT CTC TGC CGT TGT GCG 
Ala Leu Val Arg Gin Pro Arg S/er Leu Gin His Leu Cys Arg Cys Ala 
80 85 / 90 95 

CTC CGC AGT CAC CTG GAG GGC frGT CTG CCC CAT GCA CTA CCG CGC CTT 
Leu Arg Ser His Leu Glu Gly /Cys Leu Pro His Ala Leu Pro Arg Leu 

100 / 105 110 

CCC CTG CCA CCG CGC ATG CTC/ CGC TTT CTG CAG CTG GAC TTT GAG GAT 
Pro Leu Pro Pro Arg Met Leu Arg Phe Leu Gin Leu Asp Phe Glu Asp 



115 



120 



125 



CTG CTC TAC TAGGCTTGCT GCC^TGTGAA CAAAGCAGAC CCCACCCCCA 
Leu Leu Tyr 
130 

CCCCAAGGGC ATCTCTCAGC AATGlAATGAT GCAAGGCGGT CTGTCTTCAA GTCAGGAGTG 
GACGCCTTGA TCCACACTTG AGAGAAGAGG CCAGATCAGC ACCYGGCTGG TAGTGATNGC 



48 

96 

144 

192 

240 

288 

336 

384 

433 

493 
553 



- 146- 

AGAGGGCACC TGTGCAGATC TGTGTGCGCA CTGGAAATCT CTAGGCTGAA GGCYAGAGCA 613 

AATGGTGCAR GTGTTAGTCC TTGGGANGAG AGACAGANGG TTCAGAAAGCA AGACAGAGGT 673 

GAGAGTGCAC ATGTCAAGTG GTAGATTGCC TTAAAAGAAA/ GCTAAAAAAA GAAAAAGATT 733 

CGGGCGAACT TCTTTAGGGG TAATGCTGCA GCGTGTTA^A CTGACTGACC AGCGTCCATA 793 

TCTTTGGACC CTTCCCGGGT GAAAAAGCCC CTTCATC<?TC CAGCGCTCCC CAAGGGTGCT 853 

TAGCAATACC GGGTGCTTTT CTGCCGCAAA GTGAGTtfACC AAA 896 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 13 0 amino ^fcids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein, 

(xi) SEQUENCE DESCRIPTION: /SEQ ID NO:29: 

Met Ser Ala lie Leu Lys Val Gly/His His Cys Trp 
1 5 / 10 

Ser Ala Val Asn Pro Gin Arg Meft Leu Arg Pro Pro 
20 / 25 

Phe Asn Cys Ala Ala Cys Cys fcys Leu Trp Gly Gin 
35 / 40 



Leu Pro Val Thr 
15 

Pro Thr Ala Val 
30 

Met Leu Met Asn 
45 



Thr Tyr Arg Val Val Gin Leuf Pro Glu Glu Ala Lys Gly Leu Val Pro 
50 55 60 



Pro Glu lie Leu Gin Lys Tyr His Gly Phe Tyr Ser 
65 70 / 75 

Leu Val Arg Gin Pro Arg £er Leu Gin His Leu Cys 

85 / 90 

Arg Ser His Leu Glu Gly/Cys Leu Pro His Ala Leu 
100 / 105 

Leu Pro Pro Arg Met Led Arg Phe Leu Gin Leu Asp 
115 120 



Ser Leu Phe Ala 
80 

Arg Cys Ala Leu 
95 

Pro Arg Leu Pro 
110 

Phe Glu Asp Leu 
125 



Leu Tyr 
130 



147 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
GTGGGGGCGT CATCATGACC TCCTCTAGGG ' CTCTGCAACA 
AACAAATTGT TCACTGATGA ATCCACAAGG ATCTCTGGGC 
ACATGACTGT CGTCTTCGGA GAAGGCACCA CTCGCCCQCG 
CATGGGAGAA GACGTATCCA GGCAGCAGCT GCGCGGGCCT 
ATCTAAAGGC ACGGTGTACT GAAGGTAGTC CTGAGACATG 
GTGTTCCTCC AGGTGGAGGC TCAGGTCCCC GGGTCBAGCTG 
GCGCGGCTCT GGCTGCAGGT CTCGCAGCTC CCTJSGGCTGT 
CACACCGTTG ACTGGT 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2180 base pairs 

(B) TYPE: nucleic aci< 

(C) STRANDEDNESS: sir^le 

(D) TOPOLOGY: linear; 

(ii) MOLECULE TYPE: DNA 



'GACTCCTGT 
CTACAACCAG 
GCAGGTACGG 
TCAAGAGGGC 
AGTCCGATTA 
GGGCTGCAGC 
AGCTCCCGCA 



GGTGCAAATC 
GTCCTGGTCC 
CTGACACCTC 
ACATCCCGTC 
CTACAGGCAC 
GGGACTCAGG 
GATCCTTGCG 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

TTAATAGTAC C T AC AT AGT A GAAA^TTATA ACTCCACTTT 

TTCAAATCAA TTTAAAACTT TTT/^f AAACA TTAATGTTGC 

TGAAAATTAG TTGACAATCA AGTTCACCCA AGAAAATGTT 

AGATAAAACA TTTTACCAAA AGGATAGGTA ACACACAAAA 

ATGATCATCT AATATTTCTT TMVTAATAAT TCTAGTTCCA 

ATTTGTACCC GAGTTTAATT ACAGAAAAGG CAACAATTTC 

CTTTACAATT TTTTAATGTA AGGCCATTTA TTAAAATAGA 



AAAACAATGT 
AAGAGAATCC 
GACTAAGCTA 
AAATGCTATC 
TAGGTTTTCA 
TAAATTGGTG 
CAAACTAGAA 



TTTCTTTCTA 
AGTCCATTTA 
AAGAAATCAC 
ACAGGAAGCT 
TGTTATGCCA 
GTATACATTT 
GATGAAAACG 



AAGGCAACAG 
TTTAGAAAAA 
TGTCAATTCA 
TGTATATGTA 
ATGGACCATT 
TAGGGGTTGA 
AGCAAAAGGG 
TTCTGCACAT 
AGAACCAAAA 
ACTCTTTTAC 
GTTTGTAAAC 
CAATCTTTCA 
TCTGAATTAA 
AGCGCACCTA 
GCAGCGAGAC 
ATTAGATCAA 
CATTTGAGTG 
AAAGGTAACG 
GCTTCCCTTC 
TTTTCAGCTC 
AAAGTCTTGC 
TCAGCCACGT 
TTATTCCTGC 
TCTCCGGGGT 
ATCCACGAAG 
GAAAGCGCGC 
CAGACAATAG 
CTGAATGTAG 



AAAAATTCAA 
AGTGTTGTTA 
CGATTCTAAA 
AATTCCGTGG 
TAAAAGGACA 
TTACCACATG 
ACTGGGCGGG 
TGATGAAACA 
TAAACCCAAG 
ACATAAAACT 
TGTATTTCAA 
GTAGTGCTTC 
GTCTATTCTG 
TGAACCCCGG 
CTTGAATAAC 
CTATGGACGT 
CTCAATTCTA 
GTCGTCAGAA 
TGCCTCCCAA 
CTC TGTAAGG 
AACCCCAGGA 
GGGGCTTTCT 
ATTGGAGTAA 
TCTGCAACAT 
ATCTCTGGGG 
CCTCCCACTT 
CTCCGTGATC 
TCCTGAGGCA 



CTTTTCACAA 
AAAGATATGT 
TAAATCTTTT 
TTTTATCACA 
TGGCAATTTC 
AAATGTGCTT 
GGGGGCATTG 
TAATTCACAC 
ACACCTTGCT 
GAAATAGTTA 
TCTCTTGTTC 
TCCTGTAAAT 
GTATACTGAC 
AACACTGGTT 
AAAAAGCTCC 
ATGTCCTTCC 
GTGTGAAGTG 
CTGTCCCGAA 
CGTGTGATTG 
CTTGTCACAA 
GCAGAGTTCG 
GTCCAGTGAG 
CCAATGGTG 
GACTCCCGTj 
CGACAACmG 
GAGGAGGi 
CTTCCAAAJGG 
TAAGTCC3AT 



- 148- 

CCAAAAGAAT 
TGCAGATCTC 
TAAAGTAAGA 
CAGGTATGTT 
CATTCTGTTA 
TTAATGCATA 
AGGAGAATTT 
CTCTAAAACC 
GACACTTCCC 
TGGCAGCi 
TTATTCCCi 
AATCCTTQAT 
GTATAAC 

?TCT 
CATTT/TCAGA 
ACATCGGCTG 
TTTTACCATG 
CAAGAAAAGA 
GTCCCCAGTA 
jfCATGGGACC 
jATCAAAATT 
TCCACTGAAA 
AGATTGGAGG 
GTGCCAATCA 
GTCCTGGTCT 
CGCAGAGACT 
ATACATCCCC 
AACGACAGGC 



GGCAAGfl 



TAGCACAACC 
CGTTCCAT5 
GATTAAA^AAC 
TATTC|6vCAC 
AGTTTCATTC 
lTCACAG 
GAft'AATTCAC 
TyCAAGACTTC 
lACCCCTAAA 
AGATTTTGAT 
AGTGCAAGAT 
TTTGTTTGGC 
ACGACACAGG 
GACGGAAGTG 
GTCCCTGATT 
TTCATAAAAG 
GGAGCGAAAG 
ACCATCTGGC 
CCATCCTTGC 
AC T AC TTTGC 
CAAATGACAG 
GTTCCCCTTT 
GACATCCATC 
ACAAGCCATT 
ACCTGACTCT 
TCCATGGGAG 
TCATCTAAAG 
ACATGTTCAT 



TTAGAAATAA 480 

CCCAAGATTA 540 

TCATCTTCAG 600 

TGCTTTGGAA 660 

AACCTTTACT 72 0 

TGGATTAGCC 780 

ATTGTGATTA 840 

CCTTTTTTAA 9 00 

CAAACTGATG 9 60 

GGCAATGAAA 102 0 

GCAGGGTTCT 1080 

AAAGGC AGTT 1140 

T ACTGCAACG 12 00 

C AGATTCC AG 12 60 

GAATGCTCCA 132 0 

CTAAACCTAC 13 8 0 

TC AC AGCTTA 1440 

ACGTTTGCTA 1500 

TTTGCAAGTT 1560 

ACTGAGTC AT 162 0 

CGCATAACTT 168 0 

GGGATTTGGA 1740 

GTGAACCCGC 1800 

CACCGGACTG 1860 

CATCCTCGGG 192 0 

AAGAGCTGTC 198 0 

GC AC AGTAT A 2 040 

CCAGGTGAAG 2100 



149- 

ATGCAGGTCT CCATTATGAG AAGCCGAGCT CTTCAGTGAA TTGGCTTQCT CCTGGCACGT 2160 
GGTCTCAGAC TGGAGGTCGT / 2180 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 649 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



GGCACGAGGC 


TGTGTCCAGC 


ACACAGAGAG 


GGCCOGGCCA 


TCTGCTTTGG 


TTCAGAGCCC 


60 


TGTGTCTGTC 


TGTCACTTAG 


ACTCTTCCTC 


CCGGCTCGCA 


GCTCACCCTC 


CATCCTCCTT 


120 


ACTGGCTCCA 


GCATGACTCG 


CTTCTCTTAT 


GCM3AGTACT 


TTGCTCTGTT 


TCACTCTGGC 


180 


TCTGCACCTT 


CCAGGTCCCC 


TTCGTCTCCC 


GM3AACCCAC 


CGGCCCGCGC 


ACCCCTGGGT 


240 


CTGTTCCAAG 


GGGTCATGCA 


GAAGTATAGC 


M5CAACCTGT 


TCAAGACCTC 


CCAGATGGCG 


300 


GCTATGGACC 


CCGTGCTGAA 


GGCCATCAAG 


/GAAGGGGATG 


AAGAGGCCTT 


GAAGATCATG 


360 


ATCCAGGATG 


GGAAGAATCT 


TGCAGAGCCQ 


AACAAGGAGG 


GCTGGCTGCC 


GCTCCACGAG 


420 


GCTGCCTACT 


ATGGCCAGCT 


GGGCTGCCTG 


AAAGTCCTGC 


AGCAAGCCTA 


CCCAGGGACC 


480 


ATTGACCAAC 


GCACACTGCA 


GGAAGAG^A 


GCATTATACC 


TGGCCACATG 


CAGAGAACAC 


540 


CTGGATTGCC 


TCCTGTCGCT 


GCTCCAGGCG 


GGGGCAGAGC 


CTGACATCTC 


TAACAAATCC 


600 


AGGGAGACTC 


CACTTTACAA 


AGCCTGTGAG 


CGCAAGAACG 


CGGAGGCGGT 


GAGGATATTG 


660 


GTGCGATACA 


ACGCAGACGC 


CAACCACCGC 


TGTAACAGGG 


GCTGGACCGC 


ACTGCACGAG 


720 


TCTGTCTCCC 


GCAATGACCT 


GGAGGjTCATG 


GAGATCCTAG 


TGAGTGGCGG 


GGCCAAGGTG 


780 


GAGGCCAAGA 


ATGTCTACAG 


CATCACCCCT 


TTGTTTGTGG 


CTGCCCAGAG 


TGGGCAGCTG 


840 


GAGGCCCTGA 


GGTTCCTGGC 


CAAGCATGGT 


GCAGACATCA 


ACACGCAGGC 


CAGTGACAGT 


900 


GCATCAGCCC 


TCTACGAGGC 


CAGCIAAGAAT 


GAGCATGAAG 


ACGTGGTAGA 


GTTTCTTCTC 


960 


TCTCAGGGCG 


CCGATGCTAA 


CAAAGCCAAC 


AAGGACGGCC 


TGCTCCCCCT 


GCATGTTGCC 


1020 


TCCAAGAAGG 


GCAACTATAG 


AATAGTGCAG 


ATGCTGCTGC 


CTGTGACCAG 


CCGCACGCGC 


1080 



- 150- 



GTGCGCCGTA GCGGCATCAG CCCGCTGCAC CTAGCGGCCG AGCGCAACCA/CGACGCGGTG 1140 

CTGGAGGCGC TGCTGGCCGC GCGCTTCGAC GTGAACGCAC CTCTGGCTCZC CGAGCGCGCC 12 00 

CGCCTCTACG AGGACCGCCG CAGTTCTGCG CTCTACTTCG CTGTGGTCAA CAACAATGTG 12 60 

TACGCCACCG AGCTGTTGCT GCTGGCGGGC GCGGACCCCA ACCGCGATGT CATCAGCCCT 132 0 

CTGCTCGTGG CCATCCGCCA CGGCTGCCTG CGCACCATGC AGOTGCTGTT GGACCATGGC 13 80 

GCCAACATCG ACGCCTACAT CGCCACTCAC CCCACCGCCT TTCCAGCCAC CATCATGTTT 1440 

GCCATGAAGT GCCTGTCGTT ACTCAAGTTC CTTATGGACC /TCGGCTGCGA TGGCGAGCCC 1500 

TGCTTCTCCT GCCTGTACGG CAACGGGCCG CACCACCCGC? CCCGCGACCT GGCCGCTTCC 1560 

ACGACGCACC CGTGGACGAC AAGGCACCTA GCGTGGTGjCA GTTCTGTGAG TTCCTGTCGG 162 0 

CCCCGGAAGT GAGCCGCTGG GCGGGACCCA TCATCGATGT CCTCCTGGAC TATGTGGGCA 1680 

ACGTGCAGCT GTGCTCCCGG CTGAAGGAGC ACATC^ACAG CTTTGAGGAC TGGGCTGTCA 1740 

TCAAGGAGAA GGCAGAACCT CCGAGACCTC TGGCTCACCT CTGCCGGCTG CGGGTTCGGA 1800 

AGGCCATAGG AAAATACCGG ATAAAACTCC TGGACACACT GCCGCTTCCC GGCAGGCTAA 1860 

TCAGATACTT GAAATATGAG AATACACAGT A^CCAGCCTG GAGAGGAGAT GTGGCCTTCA 192 0 

GACTGTTTCC GGGACGCCCC AGGTGGCCTG CATCCAGGAC CCCCTGGGGT CAGAACAGGT 1980 

GTGACCTTGC TGGTTCTTTG CTGGAGCTTC AcCCAAAGTG AGAACCTGAT GTGGGGAGTG 2 040 

GACGTGGAAC CTCTGCTTTC ACACTGTCAGf CGGATCGCAG ACCCGCTCTG CTTCTGGCCA 2100 

TAGCCAGAGA CCTTCAACCT GGGGCCAGQG GAGAGCTGGT CTGGGCAAGG TGGCCCAGGC 2160 

AGGAATCCTG GCCTTAAGCT GGAGAACTiTG TAGGAATCCC TCACTGGACC CTCAGCTTTC 222 0 

AGGCTGCGAG GGAGACGCCC AGCCCAAGTA TTTTATTTCC GTGACACAAT AAC GTTGT AT 2280 

CAGAAAAAAA AAAAAACATG GGCGCAGCTT ATTCCTTAGT AGGGTATTTA CTTGCATGCG 2340 

CGCTTAAAGC TACTGGAAAC ATGCGTTTCCA CTATGCTTGA GAATCCCCTT GCACTGGTAA 2400 

ACGAGAGCCG ACGTGCTTCA AGGTTGGATT TTTGGTTGCC CCTTTGGCGT TCCGCGGGTT 2460 

TGTCCGACGT AATTGACCCC GTGTT7TTGTC ACTTTCGAGT GTTCCGACTA TTGGGGGGCT 2 52 0 

TTTGGTTGTC CCCAAAATTG TGGGTGGTGT GCGGACGCCA CGAGAAGTGG TTCATGGGCG 2580 

ATAATCATTA CTGGAGAATG TAGAGCGGCG GTTTTACGAA TAAATATTTT TTAAGCCGCC 2 640 

TTCCCAAAA 2 649 



7 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CCTCCTGAGA GTTCGCCGGC CCGGGCCCAA TGGGTTG^TC 
CAGCAGCAGC TTGTTCAAGA CCTCCCAGCT GGCGCOTGCG 
CAAGGATGCG ATGAAGAGGC CTTGAAGACC ATGATCAAGG 
CCCAACAAGG AGGGCTGGCT GCCGCTGCAC GAGGCCGCAT 
CTGAAAGTCC TGCAGCGAGC GTACCCAGGG AGCATCGACC 
ACAGCCGTTT ACTTGGCAAC GTGCAGGGGC ^ACCTGGACT 
GCAGGGGCAG AGCGGGACAT CTCCAACAAa/tCCCGAGAGA 
AGCGCAAGAA CGCGGAAGCC GTGAAGATTjC TTGGTGCAGC 
GCTGCAACCG GGCTG 

(2) INFORMATION FOR SEQ ID N<6:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nuclei/c acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: l/inear 

(ii) MOLECULE TYPE: /DNA 




CAAGGGGTCA 
GACCCCTTGA 
AAGGGAAGAA 
ACTATGGCCA 
AGCGCACCCT 
GTCTCCTGTC 
ACCGCTCTAC 
ACAACGCAGA 



TGC AGAAATA 6 0 

TAAAGGCC AT 12 0 

TCTCGCAGAG 180 

GGTGGGCTGC 240 

GC AGGAGGAA 3 00 

ACTGCTCCAA 3 60 

AAAGCCTGTG 42 0 

CACCAACAAC 480 

495 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GTGCAGCTCT GCTCGCGGCy GAAGGAACAC ATCGACAGCT TTGAGGACTG GGCCGTCATC 
AAGGAGAAGG CAGAACCTCC AAGACCTCTG GCTCACCTTT GCCGACTGCG GGTTCGAAAG 
GCCATTGGGA AATACCGTAT AAAACTCCTA GACACCTTGC CGCTCCCAGG CAGGCTGATT 
AGATACCTGA AATACGAGAA CACCCAGTAA CTGGGGCCAC GGGGAGAGAG GAGTAGCCCC 
TCAGACTCTT CTTACTAAGT CTCAGGACGT CGGTGTTCCC AACTCCAAGG GGACCTGGTG 



60 
120 
180 
240 
300 



ACAGACGAGG 
TCTCAGGGCC 
TTGTTTACAA 
TCTATTCCTG 
TCCCTGTGCC 
AAAGACTAAG 
GCTTTGTTAA 



CTGCAGGCTG 
CAGAGCTTTG 
ACTGATGAGC 
GGGCCAGGGC 
CCTCCACTTG 
ATGAAGACGT 
GTATTAATAT 



CCTCCCTCTC 
GCCAGAGCAG 
AGATCCCAGA 
AGAGCTTGAG 
TTCTGGAAAA 
GGCCCAAGGT 
AATAAATGTT 



AGCCTGGACA GCTACCAGGA 
AGAACAGAAT GTGTCAAGGi 
CCTTCTCTAC CTTCAGG^AT 
GTGTTCTGGG GAAGGTCGTG 
CTCACCACTT GAC^TCAGAG 
AGGGGGTAGG G^GAGCCTGG 
AC AC ATGTGA . 



TCACTGGG 
GAAGAATCAT 
GGCAGAAACC 
CTCAGAGCCT 
CTTTCTCTCC 
GTCTTGGAGG 



360 

420 

480 

540' 

600 

660 

709 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. . 62< 



(xi) SEQUENCE DESCRIPTIC 



SEQ ID NO: 35: 



TTG GAG AAG TGT GGT TGG TAT/ TGG GGG CCA ATG AAT TGG GAA GAT GCA 
Leu Glu Lys Cys Gly Trp Tyr Trp Gly Pro Met Asn Trp Glu Asp Ala 
1 5/10 15 



48 



GAG ATG AAG CTG AAA GGG 
Glu Met Lys Leu Lys Gly 
20 



CCA GAT GGT TCT TTC CTG GTA CGA GAC 
Pro Asp Gly Ser Phe Leu Val Arg Asp 
25 30 



96 



AGT TCT GAT CCT CGT TAC ATC CTG AGC CTC AGT TTC CGA TCA CAG GGT 
Ser Ser Asp Pro Arg Tyr /lie Leu Ser Leu Ser Phe Arg Ser Gin Gly 
35 / 40 45 



144 



ATC ACC CAC CAC ACT AGA/ ATG GAG CAC TAC AGA GGA ACC TTC AGC CTG 
lie Thr His His Thr Arg Met Glu His Tyr Arg Gly Thr Phe Ser Leu 
50 I 55 60 



192 



TGG TGT CAT CCC AAG TTtf GAG GAC CGC TGT CAA TCT GTT GTA GAG TTT 
Trp Cys His Pro Lys Phe Glu Asp Arg Cys Gin Ser Val Val Glu Phe 
65 73 75 80 



240 



- 153 - 



ATT AAG AGA GCC ATT ATG CAC TCC AAG AAT GGA AAG/TTT CTC TAT TTC 
lie Lys Arg Ala lie Met His Ser Lys Asn Gly Lys Phe Leu Tyr Phe 

85 90 / 95 

TTA AGA TCC AGG GTT CCA GGA CTG CCA CCA ACT CCT GTC CAG CTG CTC 
Leu Arg Ser Arg Val Pro Gly Leu Pro Pro Thr Pro Val Gin Leu Leu 
100 105 / 110 



288 



336 



TAT CCA GTG TCC CGA TTC AGC AAT GTC 
Tyr Pro Val Ser Arg Phe Ser Asn Val 
115 120 



TCC CTC CAG CAC CTT TGC 
rs Ser Leu Gin His Leu Cys 
125 



384 



480 



528 



576 



AGA TTC CGG ATA CGA CAG CTC GTC AG(2 ATA GAT CAC ATC CCA GAT CTC 432 
Arg Phe Arg lie Arg Gin Leu Val Arg lie Asp His lie Pro Asp Leu 
130 135 / 140 

CCA CTG CCT AAA CCT CTG ATC TCT TAT ATC CGA AAG TTC TAC TAC TAT 
Pro Leu Pro Lys Pro Leu lie Sjdr Tyr lie Arg Lys Phe Tyr Tyr Tyr 
145 150 / 155 160 

GAT CCT CAG GAA GAG GTA TAy CTG TCT CTA AAG GAA GCG CAG CGT CAG 
Asp Pro Gin Glu Glu Val Tyx Leu Ser Leu Lys Glu Ala Gin Arg Gin 

165 / 170 175 

TTT CCA AAC AGA . AGC AAG/ AGG TGG AAC CCT CCA CGT AGC GAG GGG CTC 
Phe Pro Asn Arg Ser Lys Arg Trp Asn Pro Pro Arg Ser Glu Gly Leu 
180 7 185 190 

CCT GCT GGT CAC CAC /CAA GGG CAT TTG GTT GCC AAG CTC CAG CTT TGAAGAACCA 
631 

Pro Ala Gly His Kid Gin Gly His Leu Val Ala Lys Leu Gin Leu 
195 / 200 205 

AATTAAGCTA CCATGAAAAG AAGAGGAAAA GTGAGGGAAC AGGAAGGTTG GGATTCTCTG 691 

TGCAGAGACT TTGGTtfCCCC ACGCAAGCCC TGGGGCTTGG AAGAAGCACA TGACCGTACT 751 

C TGC GTGGGG CTCCACCTCA CACCCACCCC TGGGCATCTT AGGACTGGAG GGGCTCCTTG 811 

GAAAACTGGA AGAAGTCTCA ACACTGTTTC TTTTTCA 848 



(2) INFORMATION FOR SEQ ID NO : 3 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) 

(xi) 

Leu Glu Lys; 
1 



OLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Cys Gly Trp Tyr Trp Gly Pro Met Asn Trp Glu Asp Ala 
5 10 15 



- 154 



Glu Met Lys Leu 
20 

Ser Ser Asp Pro 
35 

lie Thr His His 
50 

Trp Cys His Pro 
65 

lie Lys Arg Ala 



Leu Arg Ser Arg 
100 

Tyr Pro Val Ser 
115 

Arg Phe Arg lie 
130 

Pro Leu Pro Lys 
145 

Asp Pro Gin Glu 



Phe Pro Asn Arg 
180 

Pro Ala Gly His 
195 



Lys Gly Lys 

Arg Tyr lie 

Thr Arg Met 
55 

Lys Phe Glu 
70 

lie Met His 

85 

Val Pro Gly 

Arg Phe Ser 

Arg Gin Leu 
135 

Pro Leu lie 
150 

Glu Val Tyr 
165 

Ser Lys Arg 
His Gin Gly 



Pro Asp Gly Ser Phe Leu 
25 

Leu Ser Leu Ser Phe Arg/ 
40 4' 

Glu His Tyr Arg Gly 'JTnr 

60 

Asp Arg Cys Gin Se/ Val 
75 

Ser Lys Asn Gly £ys Phe 
90 

Leu Pro Pro Th^ Pro Val 
105 

Asn Val Lys £er Leu Gin 
120 / 125 

Val Arg xy& Asp His lie 

140 

Ser Tyr /lie Arg Lys Phe 
155 

Leu S^r Leu Lys Glu Ala 
170 

Trp /ksn Pro Pro Arg Ser 
85 

His£ Leu Val Ala Lys Leu 
20/0 205 



VaA Arg Asp 

yBO 

Ser Gin Gly 

Phe Ser Leu 

Val Glu Phe 
80 

Leu Tyr Phe 
95 

Gin Leu Leu 
110 

His Leu Cys 
Pro Asp Leu 



Tyr Tyr Tyr 
160 

Gin Arg Gin 
175 

Glu Gly Leu 
190 

Gin Leu 



(2) INFORMATION FOR SEQ ID N</:37 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 464 bise pairs 

(B) TYPE: nucleic/ acid 

(C) STRANDEDNESS / single 

( D ) TOPOLOGY : 1 i^ear 

(ii) MOLECULE TYPE: Uh 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
GTTCCAAGCC TAACCCATCT TTGTCGTTTG GAAATTCGGG CCAGTCTAAA AGCAGAGCAC 
CTTCACTCTG ACATTTTCAT CCATCAGTTG CCACTTCCCA GAAGTCTGCA GAACTATTTG 
CTCTATGAAG AGGTTTTAAG AATGAATGAG ATTCTAGAAC CAGCAGCTAA TCAGGATGGA 



60 
12 0 
180 



- 155 

GAAACCAGCA AGGCCACCTG ACACAGGTCC TTTAATTCTG TTTAG^CACA AAAGACGGCT 
TGTGTGACTG TTTGGATTTG GTGATCAAAT GTCCATGTTT ACAGTTGCTT TTCCCAGTTT 
GTGTCTTTCC CAATATTGTG AACCTTATCC ATCTTGCCTT A|cTCAGTTTT ATTTCTAGTG 
CACTTTGTTG TGTATTATTT GTTTACCTGA CCATTTTCTA/CTTTATTCTG CTAATAAACT 
GTAATTCTGA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



240 
300 
360 
420 
464 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

GGGGATCGAA AGCGGGGGCT TCTGGGACGC AGCTCTGGAG 

TTTCGGTGTA GAAGTGGCAG CACGGCAGAC TGGTCAAACA 

ACGCGGACAC GTGCTCTACA GTTGGAQTTG CTGCCAGGGA 

GGAAACTGCT CAAAAAGGGC CGAAGTJGTCG ATGTTGCTGA 

TTCATGAAGC AGCTTATCAC AACTC2TGTAG AATGTTTGCA 

CATCTGAAAA CTACATTAAG ATGAAGACCT TTGAAGGTTT 

CAAGTCAAGG ACATTGGAAA ATCGTACAGA TTCTTTTAGA 

CAACTACTTT AGAAGAAACG ACAtCCATTGT TTTTAGCTGT 

TGTTAAGGCT GTTGCTTCAA CACGGAGCAA ATGTTAATGG 

GGAACTCCTT GCACCAGGCT TCTTTTCAGG AAAATGCTGA 

GAAAAGGAGC AAACAAGGAA TfGCCAGGATG AC TTTGG AAT 

CTCAGTATGG CCAAGCTAGA jAAGCTTTGAA GCATACTTAT 

AATTGTCAAG CCTTGGACAA / AGCTACC 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1018 base pairs 

(B) TYPE: hucleic acid 



ACGCGGCCTC 
AATGGATTTT 
AGGCAATGTT 
TAACAGGGGA 
AATGTTAATT 
CTGTGCTTTG 
AGCTGGGGCA 
TGAAAATGGA 
ATCCCATTCT 
GATCATAAAA 
CACACCTTTA 
TTCATCCGGG 



GGACCAGCCA 60 

AC AGAGGCTT 12 0 

AAAGTCTTAA 180 

TGGATGCCAA 240 

AATGCAGATT 3 00 

CATCTCGCTG 3 60 

GATCCTAATG 42 0 

CAGATAGATG 480 

ATGTGTGGAT 540 

TTGCTTCTTA 600 

TTTGTGGCTG 660 

TGCAAATGTC 72 0 

747 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

CACAAATGGG AC CAT AC AAA AATCTTGGAC TTGTTAA^AA CCACTTACTA ACCGGGACCT 60 

GTGACACTGG GCTAAACAAA GTAAGTCCCT GTTTAQtCAG CAGTGTTTGG GGGACATGAA 12 0 

GGATTGCCTA GAAATATTAC TCCGGAATGG TCTA^AGCCC AGACGCCCAG GCGTGCCTTG 180 

TTTTTGGATT CAGTTCTCCT GTGTGCATGG CTT^CCAAAA GGAGGTGGAG CTGTAGTTCT 240 

TTGGAATTGT GAACATTCTT TTGAAATATG GAJ&CCCAGAT AAATGAACTT CATTTGGCAT 3 00 

ACTGCCTGAA GTACGAGAAG TTTTCGATAT T^TCGCTACTT TTTGAGGAAA GGTTGCTCAT 3 60 

TGGGACCATG GAACCATATA TATGAATTTG /TAAATCATGC AATTAAAGCA CAAGCAAAAT 42 0 

ATAAGGAGTG GTTGCCACAT CTTCTGGTTQf CTGGATTTGA CCCACTGATT CTACTGTGCA 480 

ATTCTTGGAT TGACTCAGTC AGCATTGA0A CCCTTATCTT CACTTTGGAG TTTACTAATT 540 

GGAAGACACT TGCACCAGCT GTTGAAA^GA TGCTCTCTGC TCGTGCCTCA AACGCTTGGA 600 

TTCTACAGCA ACATATTGCC CACTGTTCCA TCCCTGACCC ATCTTTGTCG TTTGGAAATT 660 

CGGTCCAGTC TAAAATCAGA ACGTCTACGG TCTGACAGTT ATATTAGTCA GCTGCCACTT 72 0 

CCCAGAAGCC TACATAATTA TTTGQfTCTAT GAAGACGTTC TGAGGATGTA TGAAGTTCCA 780 

GAACTGGCAG CTATTCAAGA TGG^TAAATC AGTGAAACTA CTTAACACAG CTAATTTTTT 840 

TCTCTGAAAA ATCATCGAGA CAA^AGAGCC ACAGAGTACA AGTTTTTATG ATTTTATAGT 9 00 

CAAAAGATGA TTATTGATTG TC^GATAGGT TAGGTTTTGG GGGGCCAGTA GTTCAGTGAG 960 

AATGTTTATG TTTACAACTA GtfCTTCCCAG TAAAAAAAAA AAAAAAAAAA AAAAAAAA 1018 



(2) INFORMATION FOR SE& ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: / 1897 base pairs 

(B) TYPE: nJucleic acid 

(C) STRANDHDNESS: single 

(D) TOPOLOCJY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
CGGGGGGCTG GGACCTGGGG CGTAACCGTC TCTACCACGA 
AAACATACCC AGCCTTTCTG GAGCCGGACG AGACATTCAT 
TGGCCCTGGA CATGRATGAT GGGACCTTAA GTTTCATCGT 
TGGCTTTCCG GGGACTCAAG GGTAAAAAGC TGTATCCTGT 
ACTGTGAGAT CCGCATGCGC TACTTGAACG GACTTGATQE 
ACCTGTGCCG GCGTTCGGTG CGCCTAGCGC TGGGAAA^GA 
CTCTGCCGCT ACCTGCCTCC CTCAAAGCCT ACCTCCTCTA 
ACCGCCATAC GACAGCCATC TGGTGCCAAR TCACJGAGCC 
TGCGCCTGGG ATGGAAGCCC ACCTCAGCCA TGGGCAGACG 
TGCCTCTGCT GGGGGAACCT ATGCCAACGG AOTTCTCCCT 
GCAGCACCCA GGCCCTTCCC TGAACCAGAT yGCAGAGAATA 
GGCGCCTTCT GCTCTCAGGT GGAGTGGGCj/ GCCCCCCACT 
CCCACCTGGG GGGTCCTGGG AGGTAAGACT AGTAGGAGGT 
CAGGAATGGC CAGGAMCAGG CCATAC5K2AT GAAGCTCAGG 
TGAGACAGAA CCCCAGGTTG GAMTTCCCTT GGGCCAACGA 
TGCMGGTGCT CTGTGGCCTG TAT r OTATTCT TTAAACAGTA 
CCACTTAGAA AGGAAACCTT GGTfGGGTGGY TTCCCTCGAT 
GGAATGTGTG TGCCACACCT G'flCCTTGTCC CAGGCCAGGA 
TGCACAGATA C AC GT ATGTC OTCGTGCATG ACCCCTGACT 
CCAAGCACCA GAGCAGACCC aHAAGAGAGGC CCGTGCAAGT 
GCTTCTGTTG CCTTGGGACy CATACACCGG CACACGTGTT 
AGCTTCGAAT TTTGCCCCGG ATTCTTCTGA TATTTCCCAT 
GGCCTGGAGG GCATTAGGAC ACATGGAATG AGTGGGGTCT 
TGGCAAGGCA GGATTAGAAA GACCAAGAGC AGGGTGGGGC 
CTCAGGCTCA AGACCCOGCC ACACACCCAC TCAAGCCTCA 
CCCAGGAGAG GAATGCCITGT CCTAGCAGCA CGTACATGGA 



CGGCAAGAAC 
TGTJCCCTGAC 
GQ 

4gtgagtgcc 
tgagcccctg 
gcgcctgggt 
ccagtgatcc 
cgttggggtc 
tgccccctca 
tcccaacact 
aactatgaaa 
ctctgcagag 
gccagggctg 
atgtcacata 
gtgccagctt 
gcaaaggcca 
gtgctttccc 
ctgtggcaca 
agttcctaag 
ccccatgtcc 
tcagcctctt 
tggcatcctc 
ccagcccctg 

GCCATGAAGC 
GAAGTGGTGT 
GCACCCCACA 



CAGCCAAGTA 60 

TCCTTTTTCG 120 

TAC ATGGGAG 180 

GTCTGGGGCC 240 

CCACTCATGG 3 00 

GCCATCCCCG 3 60 

AC ATCCCAGG 42 0 

CGCCGACCCC 480 

TCCTACCGGC 540 

GGCTGAAGCA 600 

ACCTCTCTCA 660 

AGAGGCTACA 72 0 

ARTCCAAAAG 780 

CCATGGACAM 840 

TAATGTCAGC 900 

TTTATTTATT 9 60 

CCACCTCCCT 102 0 

TGAGCTGGTG 1080 

TAGCCCTGCA 1140 

CC AGGTCCCT 12 00 

GACTTCCATG 12 60 

CAAAGCTCTG 132 0 

GGAAAGCC AC 13 80 

CTGTATGCCT 1440 

GTAGGGCAGC 1500 

TGTGCTCCAG 1560 



i 
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CCCTCTGGCT GTTTCTCTTG CTCTAGAATC AACTCCCTAC ATTGGGAATG' TAGCCATTTG 
GTAGAGGACT TGCCTAGCCT GCAGGAAGCT CACGTTCCAT CCCCTGCACC AAGGAGAATC 
AAAGCTCAGG AGGCTGAGGC AGGAGGATTG CTGTCAGTGG TGTACAGAGG TCATGGCCAT 
CCTGGGCTAT ATTAAACCTT GTCCTTTAAG AAAAAGAAAA GAAATCAACT TCCATTGAAT 
CTGAGTTCTG CTCATTTCTG CACAGGTACA ATAGATGACT TK^TTTGTTG AAAAATGKTT 
AATATATTTA CMTATATATA TATTTGTAAG AAGCATT 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



1620 
1680 
1740 
1800 
1860 
1897 



(xi) SEQUENCE DESCRIPTION: SEQ /ID NO:41: 

Gly Gly Trp Asp Leu Gly Arg Asn Arg Leu 
1 5 / 10 

Gin Pro Ser Lys Thr Tyr Pr© Ala Phe Leu 
20 / 25 



Tyr His Asp Gly Lys Asn 

15 

Glu Pro Asp Glu Thr Phe 
30 



lie Val Pro Asp Ser Phe Rhe Val Ala Leu Asp Met 
35 / 40 

Leu Ser Phe lie Val Asp/ Gly Gin Tyr Met Gly Val 
50 / 55 60 

Leu Lys Gly Lys Lys Leu Tyr Pro Val Val Ser Ala 
65 lb 75 

Cys Glu lie Arg Met Arg Tyr Leu Asn Gly Leu Asp 

85 / 90 

Pro Leu Met Asp Leu/ Cys Arg Arg Ser Val Arg Leu 
100 / 105 

Glu Arg Leu Gly ALa lie Pro Ala Leu Pro Leu Pro 
115 / 120 

Ala Tyr Leu Leu Tf/r Gin 
130 



Xaa Asp Gly Thr 
45 

Ala Phe Arg Gly 



Val Trp Gly His 
80 

Pro Glu Pro Leu 
95 

Ala Leu Gly Lys 
110 

Ala Ser Leu Lys 
125 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
AAGGGTAAAA AACTGTATCC TGTAGTGAGT GCCGTCTGGG /GCCACTGTAG ATCCGAATGC 
GCTACTTGAA CGGACTCGAT CCCGAGACTG CCGCTCATGQ ATTTGTGCCG TCGCTCGGTG 
CGCCTGGCCC TGGGGAGGGA GCGCCTGGGG GAGAACCAG/a CCTGCCGCTG CCGGCTTCCC 
TCAAGGCCTA CCTCCTCTAC CAGTGACGTT CGCCATC^TA CCGCCAGCGC GACAGCCACC 
TGGTGCCAAC TCACTGAGCC GCCTG 



60 
120 
180 
240 
265 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
I (A) LENGTH: 243 8 base pairij 

* (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
AAGTGGCGGC GGTCCCTGGA GAGCAGGCGG RGGCAGCGGC 
CGTGGAGCCG GGGCGGGGGC TGACAGCCAG /GCCTCCGCCT 
CGGGAGTGGC CGGGCCTCTC TTCCGCGCTT/ GAGCGAGCGC 
GGCGGCAGGC GCTCGGACAG CTCCGCTTGA GCTGAGCTCG 
GCCCAGAAGA AACTTCCTCT TAGAAAAGOT GAAAAACACA 
TGTAAAGAAT TTGTTTAAAA TGGCTGAAAA CAATAGTAAA 
AACAAGTCGG AGTCGAAGTG CTGACAGG2 A GGATGGTTAT 
GTCTTGGTCC AAAAAGAGTG AGAGTTGT1 C TGAATCTGAA 



AAGTCTGACT 
GGCGGGAGCC 
CGGGTGATGG 
GAGAGATCCG 
RTATTTATAA 
AATGTAGATG 
GTGTGGAGTG 
GCC ATAGGTA ' 



CTGGGCTGAC 60 

GCACGAGGAG 12 0 

CGGTGGTGAT 18 0 

TCCAGAAAGT 240 

CACTGGAAAT 3 00 

TACGGCCTAA 3 60 

GAAAGAAGTT . 42 0 

CTGTTGAGAA 480 
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TGTTGAAATT 
CTTAGATCAT 
TGCGGTGGGG 
ATCTAAAAGA 
CTCAGATTTA 
CTCAGATGAA 
ACGAAGAAAC 
CATAACTGCC 
GGTCACAAAC 
GTGCACAAGC 
GTTGGAGGCA 
CCTCCTTCAG 
AGCTCTGCTG 
TTATTTATTC 
GTGGAATCAT 
TACTGGGCTC 
GTCCACTCCC 
TTGTAATTGT 
GTATCTGAAG 
GCAGCAGTGA 
TATTTTTCTT 
CCCTAAGTTT 
AAGCAGGTGT 
AATTTCAAGG 
TTTCCTATGT 
GTTACCTATT 
ATAATGTAAT 
ATGAAACTTA 



CCTCTAAGAA 
TCCTGTGGGC 
CAGTGTTTTC 
AAGATTCATA 
GCCTTTAGGT 
TGGGTGAGTG 
ACAGAAGATG 
AACAGTGCTT 
AACAGCATAG 
TCCAGAAAAA 
CCTCCTAAGT 
ATCAGTAACA 
GAAGGAAAGC 
TCTGTTAGTT 
AACTTTAGCT 
CTGGAACACT 
TTAATCCGGA 
AC GACTT ACG 
GAATACCATT 
TGCGGAGAGG 
ATGCCTCTTT 
TAATTCCAGA 
TTGGTTTTGT 
CATTGCATAT 
GTGAAATATT 
CTCTTTTCAT 
TGACTTTTGT 
ATGTATTTTC 



GCCAAGAAAG 
ATAGATTTTT 
CAATAAAGAA 
TCAGTGAACT 
GGCATTTTAT 
CAGACCTGTC 
ACATACCCTG 
CGTGTACAGG 
AAGACAGTGA 
GGAATAAGCC 
TCCACACCCA 
ATCCGTGCTA 
CAGAGGGCAC 
TTAGACGCTA 
TTGATGCCCA 
ATAAGGACCC 
CGTTCCCCTT 
ATGGCATCGJ 
ATAAATCi 
TTAGAATGfrC 
GAATTTT5TGT 
TCAATTTATT 
TTTTAGECATA 
ACATTCTGAAT 
TTGCTAATCT 
CTTQAAGATT 



AAT 



ATT TTAAATA 



'TGCCAA 



gcagcttagc 

aggccgatcc 

ttgtagtggc 

catgttagat 

taaacgacac 

tgagaggaaa 

tttctcaca' 

tggtcacata 

catggattca 

caggtgggaa 

gatcgm:tac 

CTGGGGTGTC 
CTTTTTACTT 
CAOTCGTTCT 
TGTATCCTTGT 
AGTGCCTGT 
'TCCTTGCAG 
TGCCCTTCCC 
AGTTAGGTTA 
GACCTGCATA 
ACAAAGGCAG 
TTTTTTATGA 
TAAATTTACA 
ATTCTGTATT 
ATGCTATCAG 
TTCAGTAAAG 
TAGGAGTGTT 
TTAACTAAAC 



tgttcctcca 
cttaa&caga 
cgaccactctc 
jtgccctt 
:tgttccta 
/ctgagagatg 
accaatggcc 
actggttcta 
gaggatgaaa 
atggaagagg 
gtccactgcc 
atggacaaat 
c g agattc ag 
cttcatgcta 

GTCTTCCATT 
ATGTTCTTTG 
CATATTTGCA 
ATTCCTTCGC 
CTCAGGATTG 
CATATTTTCA 
TTGAATCAAA 
TACACTTGTT 
T ATGGTC C AG 
TTTTAAATAA 
TATTCTTGTA 
AGTGTTGTAA 
AAACAACAAA 
CAAGTTTGTT 



TTGAGTTGGA 540 

AACTGCAAGA 600 

CAGGGCTTCC 660 

TCCCACCTCG 720 

TGAGTCCCAA 780 

CTCAGCTGAA 840 

AGCCTTGTGT 900 

TGATGAACTT 9 60 

TTATAACGCT 102 0 

AGATCCTGCA 1080 

TTGTTCCAGA 1140 

ATGCAGCCGA 12 00 

CGC AGGAAGA 12 60 

GAATTGAGC A 132 0 

CTCCTGATAT 13 80 

AGCCGCTCTT 1440 

GAACGGTTAT 1500 

CTATGAAATT 1560 

ATGTGCC AGA 162 0 

TTTAATATTT 1680 

TAAAACTGTG 1740 

ATATATTTTT 1800 

GC ATATTTAC 1 8'6 0 

TCTTTTGTTC 192 0 

TGACCGAATA 198 0 

TC AATCC ATT 2 040 

ATGATTTAAA 2100 

TGTTAGTTAT 2160 
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TCTAGCCAAT AAGAAAAGAG AATGTAGCAT CCTAGAGGTG TATTTSTTCT GCAGTTTGGC 

AGGACCGTCA GTTAGTCCAA ATAAACATCC CCTCAGCGTG GAGGCGAATG GAACCTGTGC 

TCCTTTCTTA CGGGAAGCTT TGCAAAGCAA AATAGCAGGG TTACAAGCTT GGAGTTGTTA 

AGGCAACTAG AGTTTTCTCT ATTAATTTAT AGACTGTTGT TJGC AC CT AC T TAGCTCTTTT 
TTGGGAACTC TAGTTCCCAG GGGAAAATAC CTCGTGCC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 542 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



2220 
2280 
2340 
2400 
2438 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ser Gly Gly Gly Pro Trp Arg Ala/ Gly Gly 
1 5 / 10 

Ser Gly Leu Thr Val Glu Pro Gly Arg Gly 
20 / 25 

Pro Gly Gly Ser Arg Thr Arg />er Gly Ser 
35 

Arg Leu Ser Glu Arg Arg Va]/ Met Ala Val 
50 55 

Arg Thr Ala Pro Leu Glu L^u Ser Ser Glu 

65 70 

Pro Arg Arg Asn Phe Leu Aeu Glu Lys Leu 

85 / 90 

Thr Leu Glu lie Val Lys /Asn Leu Phe Lys 
100 / 105 

Lys Asn Val Asp Val Arg Pro Lys Thr Ser 
115 7 120 

Arg Lys Asp Gly Tyr Vafl. Trp Ser Gly Lys 
130 / 135 

Lys Ser Glu Ser Cys Ser Glu Ser Glu Ala 
145 150 



Gly Ser Gly Lys 



Ser Asp 
15 



Leu Thr Ala Arg Pro Pro 
30 

Gly Arg Ala Ser Leu Pro 
45 

Val Met Ala Ala Gly Ala 
60 



Arg Ser Val Gin 
75 

Lys Asn Thr Xaa 



Lys Val 
80 

Phe lie 
95 



Met Ala Glu Asn Asn Ser 
110 



Arg Ser Arg Ser 
125 

Lys Leu Ser Trp 
140 

lie Gly Thr Val 
155 



Ala Asp 
Ser Lys 



Glu Asn 
160 



Val Glu lie 
lie Glu Leu 



Ser Leu Lys 
195 

Lys Asn Cys 
210 

lie His lie 
225 

Ser Asp Leu 



Pro Leu Arg Ser 
165 

Asp Leu Asp His 
180 

Gin Lys Leu Gin 



Met Ser Pro 

Lys Leu Arg 
275 

Pro Cys Phe 
290 

Ser Ala Ser 
305 

Val Thr Asn 

lie lie Thr 

Glu Met Glu 
355 

Thr Gin lie 
370 

Ser Asn Asn 
385 

Ala Leu Leu 
Ala Gin Glu 



Ser Leu His 
435 



Ser Gly Arg His 
215 

Ser Glu Leu Met 
230 

Ala Phe Arg Trp 
245 

Asn Ser Asp Glu 
260 

Asp Ala Gin Leu 



Ser His Thr Asn 
295 

Cys Thr Gly Gly 
310 

Asn Ser lie Glu 
325 



Leu Cys Thr S 
340 

Glu Glu lie L 



Asp Tyr Val His 
175 

Pro Cys Tyr /Trp 
390 

Glu Gly Ly^ Pro 
405 



Asp Tyr Le 
420 

Ala Arg II 




L^Q Ser Cys Ser 
175 

g Phe Leu Gly 
190 

Gin Cys Phe Pro 
205 

Pro Ser Lys Arg 
220 

Pro Phe Pro Pro 



Arg His Thr Val 
255 

Asp Leu Ser Glu 
270 

Thr Glu Asp Asp 
285 



Met Asp 
395 

Thr Phe 
410 



Val lie Thr Ala 
300 

Ser Met Met Asn 



Asp Ser Glu Asp 
335 

Asn Lys Pro Arg 
350 

Pro Pro Lys Phe 
365 

Asp Leu Leu Gin 
380 

Lys Tyr Ala Ala 



Leu Leu Arg Asp 
415 



Ser Val 
425 

Gin Trp 
440 



Ser Phe Arg Arg Tyr Ser 

430 



Asn His 



Asn Phe Ser Phe 
445 



Ser 
Arg 
He 
Lys 



Arg 
240 

Pro 



Arg 
He 
Asn 



Leu 
320 

Glu 



Trp 



His 



He 



Glu 
400 

Ser 



Arg 
Asp 
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Ala His Asp Pro Cys Val Phe His Ser Pro Asp lie Thr Gly Leu Leu 
450 455 460 

Glu His Tyr Lys Asp Pro Ser Ala Cys Met Phe Plie Glu Pro Leu Leu 
465 470 475 7 480 

Ser Thr Pro Leu lie Arg Thr Phe Pro Phe Ser yLeu Gin His lie Cys 

485 490 / 495 

Arg Thr Val lie Cys Asn Cys Thr Thr Tyr Asp Gly lie Asp Ala Leu 
500 505 J 510 

Pro lie Pro Ser Pro Met Lys Leu Tyr Leu Lys Glu Tyr His Tyr Lys 
515 520 / 525 

Ser Lys Val Arg Leu Leu Arg lie Asp Val' Pro Glu Gin Gin 
530 535 / 540 



Q 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 



CCCTCTGGGC 


AAGCCGCCCC 


CCCCCCACCC 


ATCTACCACA 


CACACACACA 


CACACACACA 


60 


CACACATTCA 


GACCTTGGGG 


CAAAAACAAA 


GCAAAATAAC 


AACAACAAAA 


ACACTGCCTG 


120 


TGGAAAGTCC 


TTACTTCAGG 


AAGGTTGGCA 


GATGAGGAGC 


AAGGGAACAT 


TTTATCAGGA 


180 


CTGCCACAAA 


GGAGTCTTTT 


TTTTTAATGG 


^TTTCAAGA 


CAGGGTTTCT 


CTGTATAGCC 


240 


CTGGCTGTCC 


TGGAGCTCAC 


TTTGTAGACC 


\GGCTGGCCT 


CGAACTCAGA 


AATTCGCCTG 


300 


CCTCTGCCTC 


CTGAGTGCTG 


GGATTAAAGG 


CGTGCAGCAC 


CATGTCCAAC 


TGGCATTTTC 


360 


TCAATTAAGG 


TTCGTTCCTT 


TCAGATAACT 


CTAGGTTCTG 


GGTCAAGCTG 


ACACAAGGCT 


420 


ACACAGCACA 


GTTTGTATGC 


CACATTCAGT 


TCAGAAGACA 


CCCAACCTCC 


CTGGAACTGG 


480 


AACTTATGCA 


CATTTGTGAG 


CTTCCACTTG 


GGAGTGGGAA 


CCTGAACTGG 


GTCCTCTGCA 


540 


AGAGCAGCCG 


TGCTCTTAAC 


TGCTGAGCCA 


TTTCAGCAGC 


CTCACATCAG 


AATTAAGTTA 


600 


GAAATTAGCCG GGTATGAATC ATACCCTTAG 


AATCCTAGCA 


TCTGAAAGCA 


GAGCTAAGAG 


660 
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AAACAGGGAT 
AAGAGACTAT 
AGGCTGGGCA 
CAGTGTGAGT 
GTCTCTCTCT 
CACACACACA 
GCCCTGGCAT 
TCTGCCTCCC 
TTTTTGAACT 
GATAATGTAG 
AGACAGCGTG 
TAAGGCTTCA 
GAACACAGGA 
GGAATGACTA 
ACACACACAC 
TTAATTCTAG 
AAATAAACTG 
TCAAAGAGTG 
TTTGCCTGGA 
GTCTGGGAGG 
CGTCCGCAGC 
TATCTGTACT 
CCCGGCGTTA 
TGAGAAACTT 
TTAGGGGACC 
AGGGAAAGAT 
GGAGCTCAAC 
GCTCTCCTCA 



TCAAGACCAG CTCTTGGCTA 
TTCAAAGCCA TCCAAACAAC 
CAGGGTACAC ACCTTTAATG 
TTGAGTTCAA CGTGGTCTAC 
CTCTCTCTCT GTCTCTCTCT 
CACACACGGT GGCATTATGG 
AGATTCACTC TGTAGACTAG 
AAGTGCTGGG ATTATAGGTG 
GTTATCAAGA GGCTTTCGAG 
CTAATGATCA AACGACACTC 
CCCACTCGTA GTTCCATTAC 
GCAACCTAGG GAGCCGCAGG 
GCAGGAGTTC AGGAAGGGTG 
GCTCAGGCAG AGAGAGCAAA 
ACACACACAC ACACACACAC 
TCTGGGATGG GGGGGAGGGT 
CTGAAGAGTC TGACGCCAGG 
TGCTCCACAA AGCATGCGCG 
TTCTTTGTAG CCGGTGGGTT 
TGACGATAGG GTTAATCGTC 
CCCGCTGGAG CCGGAAGCAG 
TC C AC AG AGG TCTCTGCGAG 
GAGCCAGCAA GGGGACGGTT 
GGGGGGCGCG ACACAGATAGJ 
AAGGGAAGAC CAGGCTGGTTj 
GATGCGCCTA ACAGTCCTTT 
TTTCAAAAGC GAGACGCCCC 
TGGGCCAGAC GGCCCTGGCA 



CAGAGCCCGT CCTGTCCTAG 
AATAACTACA ACAACAACAA 
CCAACACTCA GGAGGCAGAG 
ATAGGGAGTT CTAGGCCAGC 
CTCTCACACA CACACACACA 
GATTTTTTTG GGAT^\GGTT 
GCTAGCCTTG AACTCAGAGA 
TTGCACCACC ACTGCCCAGC 
GAGGTCAAAC T2TCAACAGCA 
AAAACTTAAC CCTTAAAGCA 
TCAGGAGGCT/ GAAGCAGGAG 
GGACAGTAGT CTCAATCCCT 
TCAAGGCGGC TTACTGATCT 
GGTCTCOAGT GGAGAAGTCT 
AGAATGCAAG GCGATGACGT 
GGGGGACGCA GCTGTCAGGT 
GAGTJCCTGGG AGGGACAAGA 
CTTGTCCACG TCTGGAGTCG 
CTCAAGGCGG TAAGTGGTGT 
CACAGAGCCC AGGGGCGGAG 
TGGCTGGTCA GGGGCGCTTC 
CTAGGGGGAC AGTGAGGTGC 
fCACGGTAAGG TCTGAGGGAG 
GGTGAAAGCA GAGTGATAGA 
GGCATACACC GGTGAACGGA 
CTGTCTCCAC ACCACTCCAG 
AGCAAGCCTG TTTTGAGAAG 
AGGGGCAGCA GCAGCACCCC 



GATGGGCTAC 72 0 

'GGTTAAAATT 780 

GCAGGCTGAT 840 

AGAGGTTACA 9 00 

CACACACACA 9 60 

TCTCTGTCTA 102 0 

TCCGCCTGCC 1080 

CACTTTGGGA 1140 

ACCTCTCCAT 12 00 

CACATCCACC 12 60 

GATGAAGGAC 132 0 

ACATTCTCCT 13 80 

TAGGGCCTC A 1440 

ACACACACAC 1500 

C ATC AAAGGG 1560 

GGCTTTGGAA 162 0 

GGTTACCCAC 1680 

TC ACTTATTT 1740 

GGCCGCCGTG 1800 

CGCGGGCGGG 1860 

TAGCCTTCCC 192 0 

GGGGTAGGGG 1980 

AGAGAGCTCC 2 040 

CCTGGGATGG 2100 

TGGGAGTCCT 2160 

GGGACGATCC 222 0 

TTCTTCAGCG 22 80 

T AC C TC GC AG 2340 



GCTCTGTACT 
CCTGACCTGG 
GATGTCAAGG 
GTCCGGGGGA 
GAGCAAAGGG 
GACCACTATG 
GGAAAATTGT 
GGTGAGCAGC 
CTTGGCTACT . 
ACCCTCTATC 
GGCGAAAGAA 
TTGGGCTGGA 
ACTTAGATGG 
Q j CAGAGCCTTT 
GCCTGAGCAG 
CTCTCTCCGC 
ATGCTCTGGG 
AGCGCTATCT 
GACAGGTGGA 
GGGGGGCTGG 
AAACACCATG 
CAACCTGTTT 
CCATCACTGT 
CTTACATGTA 
CTATCCCAGG 
CTGCATCAGA 
AGGCCAAAGT 
AATGGAAGGT 



CGGACTTCTC 
TTGCCCAACG 
AAGGGGGTCT 
AACGGGGCTA 
GCACACACGC 
CGGCGCTTTT 
ATCATCAGAG 
T AGTGGTGC C 
CTATTGGGGG 
CCTCTGTAAG 
GAGGTGAGAT 
AACTCATGGT 
CCTTGGATCT 
GGGTCTCCCT 
GATCTCAAGT 
AGTGGAGGAA 
GGACACCCGG 
GCTCTACAAA 
GAGGCACCCG 
ACCCCTTCAC 
GCAGCCTGGG 
CATTATTGTT 
CTTAAGGAAT 
GGATGGTTCA 
CCTCTTAGGG 
CATCCAGTAG 
GACACGAAGC 
GATTTCACTT 



TCCTCCCGAG 
GCACCACGGC 
GTGCTTTGAG 
TTCGAGAGGT 
CGTGGTGGGC 
GGGCAGCAAC 
TAAGGGCCTC 
AGAGAGACTG 
CACGTACCTG 
TGCTGTTTGG 
ACGGACTAGG 
TGGAGCACAG 
AGCTTCACTC 
CAGCTGAGGT 
TCAAGGATGC 
CCACAATCCC 
CTGGGTCAAA 
TGACCCAGTA 
CTGGCCTAGA 
CTCCCCTTCT 
ACAAAGAGGT 
TTTTGTTTTG 
TATGACAACC 
CAAACACAAT 
TCTCATGTAT 
AACTTGGGAG 
CCACTTCCTG 
GTCAGGGCCC 



GAAGTAGGtf 
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GGCTTGGAGG 

TGGAACCCCA 

CGGCGCCCTG 

CTGCACGCCT 

GTGGCCACCG 

AGCGAGTCCT 

GAGGCCCCCC 

CTGGTGGTTC 

GGACCAGCCT 

GGCCAGTGCCj 

TGTGGGGAGJ 
T 

CCAATCCCTA 
GGCGGTGGAA 
CTGGAffiTTGC 
TTCTC&CACCT 
TATGfCACTCT 
GTACAGGGTG 
CAACTTTAAA 
CACAGGAGCA 
TfTTGAAGTA 
TTACACTC 
(tACAAAGCTC 
2 AGGGGCTT 
ACCGAATTCA 
TGAAGCTAGA 
GCTCCAACC 
.GGGACAG 



GGGAGA5 



7CTGGGA 



PATCCAGC 



AGCTCCTGTC 
AGGATTGCTC 
TGGCCCAGAG 
'CAG 

CCCTQGCCCC 
GGGC 
AGJ 

1GACATGGA 
'CCGTGGACT 
AGGTCCGCAT 
TCACTACTCT 
TCTTGTCACT 
TTGGATGTGA 
ATGGAGGAAG 
TTACTTACCT 
GAGCCGCCTG 
GCCTTTGCCC 
TGCTGGCACC 
AAGC TGGTG A 
AGACATATAG 
AAAAATGAGA 
CCCCACCCCA 
AGGCCCAGGT 
TGGCACCGTG 
GACCCGAAAG 
GCCAAGGCCA 
ATGAGTTTCC 
TCAGTTCTAC 



'GCTCCCCCT 2400 

CGAGAACATC 2460 

C ACTGATGGA 2520 

CTGGCCCCTG 2 580 

GCTGCAGGCT 2 640 

TATTGGGCGG 2700 

TGGACCTCAG 27 60 

GGAGGGGACT 282 0 

GAAGGGGAGG 2 880 

CCGCTACATG 2940 

TGGCAATGGT 3 000 

TTGGCCTGTC 3 060 

TGC ACAAATT 312 0 

AAGGAAGGGT 3180 

TGTCTTCCTT 3240 

TGTGTGCGCC 33 00 

CCTGCCATGA 33 60 

CTACCGTGGG 3420 

AGCTGGGGGG 3 480 

AAATGATATT 3 540 

TGTATTGTC A 3 600 

GGCTAGAGCC 3 660 

GTTTATTTCC 372 0 

GGGGAGGGGA 3780 

CTCTGAATTT 3 840 

TCTAAGTGAC 3900 

AGCCCAAACC 39 60 

TCCCTCCCCT 402 0 



CACTAGGAGC 


CACCTTGGTG 


ACAGTTGATT 


CTACCCACTG 


TAAGTGGT^A 


AGGGATTGGC 


4080 


CTGGTCCCAA 


CCATAATAGG 


GCGGTGGAAA 


CGGCTCAGGA 


GGGTACAGCG 


TGGATTAGGC 


4140 


CACAAGATGG 


GGCAGATGAT 


GTCATCAGAA 


GCATGTGACC 


GGTGG(SAGCA 


GTTACTAAAC 


4200 


TTCTGGGCAA 


CCTAGTCCAT 


GCTATGCAGG 


CAGGTAGAGG 


GATQGGCAGT 


GCTCATTGTT 


4260 


TGGCATTGAT 


GATGTCCACA 


AATTCAGGCT 


TGAGAGATGC 


GCJCACCCACA 


AGGAAGCCGT 


4320 


CCACGTCAGG 


CTGGCTTGCC 


AGCTCTTTGC 


AGGTTGCTCC 


AGTCACAGAA 


CCTGTACCAG 


4380 


GAACAAGAAG 


ACAGTTTGGT 


CAGGTCTATG 


ATCAGAACACy 


/ttaagcccca 


CCTCTCTGTG 


4440 


CAAGGCAGCC 


TCAGTCTGTC 


TTAGCCCATT 


TCCGTCTTAjS 


CTAGAGCCAA 


AGCCACTCAC 


4500 


CTCCATAAAT 


GATCCGGGTG 


CTCTGAGCCA 


CCCCATCATT 


GACATTGGAT 


TTCAGCCATC 


4560 


CCCGGAGCTT 


CTCGTGTACT 


TCCTGTGCCT 


AGAAGGM5GA 


GGCAGAGCTA 


CTAAGTAAGC 


4620 


TCCTTCCTAT 


CTATCATTCA 


AGGAGTAAAA 


ACCACTGGTT 


CTCACATAGA 


GTTGAGTTTC 


4680 


CAGAAAAGCC 


CCGGGACCAG 


AGAGTGGCAA 


GGCTGCAATC 


CCACCAGGCT 


TGGAATGAAC 


4740 


ATTTTTGGCA 


AAGTCACTCT 


CCTTGGTGAG 


TTTffiGGGGCC 


CTCTGTCTCT 


AAAGGGGCTT 


4800 


GGATGGGCTC 


CATAGCTGTG 


TGAGTCTGTT 


AAAGCCGGAC 


AGGCTGAGGA 


GCTCTGGGTA 


4860 


GTTACCTGCT 


GAGGGGTTGC 


CGTCTTGCCA 


GTCCCAATGG 


CCCACACAGG 


TTCATAGGCC 


4920 


AGGACCACCT 


TGCTCCAGTC 


TTTCACATTA 


TCTGTGGGGC 


AGAGAGGAGA 


GTGAGTAGGA 


4980 


AGGAGCTGAC 


CCGCCAAGC 










4999 



(2) INFORMATION FOR SEQ ID NO : 4/5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 64 amirio acids 

(B) TYPE: amino acifl 

( C ) STRANDEDNESS : jingle 

( D ) TOPOLOGY : 1 ineir 

(ii) MOLECULE TYPE: DN A / 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Met Gly Gin Thr Ala Leu Ala Arg Gly Ser Ser Ser Thr Pro Thr Ser 
1 5 [ 10 15 

Gin Ala Leu Tyr Ser Asp Phe Ser Pro Pro Glu Gly Leu Glu Glu Leu 
20 25 30 

Leu Ser Ala Pro Pro Pro Asp Leu Val Ala Gin Arg His His Gly Trp 
35 40 45 



Asn Pro Lys Asp 
50 

Cys Phe Glu Arg 
65 

Lys Arg Gly Tyr 



Leu Glu Gin Arg 
100 

Ala Pro Leu Gin 
115 

Glu Ser Trp Gly 
130 

Lys Gly Leu Glu 
145 

Leu Val Val Pro 



Thr Leu Gly Tyr 
180 

Gly Leu Lys Gly 
195 

Gin Cys Gin Val 
210 

Pro Gin Ser Leu 
225 

Gly Asp Thr Arg 



Met Lys Arg Tyr 
260 



Cys Ser Glu 
55 

Arg Pro Val 
70 

Ser Arg Gly 
85 

Gly Thr His 
Ala Asp His 



Trp Asp lie 
135 

Ala Pro Gin 
150 
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Asn lie Asp Val 



Ala Gin 



Leu His 



Ala Val 
105 

Tyr Ala 
120 



Ser Thr 
75 

Ala T; 
90 



Lys/ Glu Gly Gly 
6( 

^sp Gly Val Arg 



Glu lie Ser Trp 
95 

Val Ala Thr Ala 
110 

Leu Gly Ser Asn 
125 

Leu Tyr His Gin 
140 

Pro Gin Gly Glu 



INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS 



(A) LENGTH: 5 6 

(B) TYPE: nucl 



(C) STRANDEDNESS: single 



(D) TOPOLOGY: 
(ii) MOLECULE TYPE: 



linear 



DNA 




Glu Arg Leu 
165 

Ser lie Gly 
Arg Thr Leu 



Arg lie A:og 
215 

Leu His /.eu 
230 

Leu Gl^J Gin 
245 

Leu L^u Tyr 



Leu Cys 
235 

Thr Leu 
250 



Asp Met Glu Glu 
175 

Gly Pro Ala Phe 
190 

Ser Ala Val Trp 
205 

Arg Arg Val Glu 
220 

Val Arg His Ala 



Pro Leu Pro Pro 
255 



Leu 



Gly 
80 

Pro 



Leu 

Ser 

Ser 

Gin 
160 

Gly 
Arg 
Gly 
Glu 



Leu 
240 

Ala 



5 base pairs 
sic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 
GTACTTTCTT TATATCTCCA TAATTTTATT TACTATTACT 
AAAGTCTTTG TAACCTCCTT AAGGATTCAC TGCTTAATCTV 
TTAAATGCGA ACCAGAAACT CTTCCAAATG TGTTACATCJ 
CTACCAACCC CATGCAATAG ATACTAATGT GATCTCTGTC 
ACAGGGAGGT TCAGTAATTT GCCCAAGGTC ATACACAQAC 
CCGGGGAGTC TGGTCCCACA GCTGGCATGT TTGCCATOAT 
TGTCGGCACT CATTAAGCAC ATTGACAGCT ATGCTTGGTG 
CTCTGTGCTA CATGCTTTAC CTGGATTATT TCAACTGCAC 
ACCATCATTG CTCCTATTTT ACATAACAGA AAACTJACAGA 
GGCTCATGCC TGAAATCCCA GCACTTTGGG AGACCCTGTC 
GGCCGGACGT GGTGGCTCAC ACCTGTAATC TCAGCACTTT 
TCACAAGGTC AGGAGTTCTA GACCAGCCTG GCC^ACATGG 
AAATACAAAA AATAGCTAGG CGTGGTGGCA GGTGCCTGTA 
TGAGGCAGGA GAATCCCCTG AACCTGGGAG ATpGAGGTTA 
GCTGCACTCC AGCCTGGGCA ACAAGAGCAA GACTCTGTCT 
ATAAAAATAT TTTTTTAAAA ATTAGCTGGG TGTGGTAGCA 
CTTGGGAGGC TGAGGTAGGA GGATCACTTG AGCCCAGGAG 
TGATGGCGCC ACTGCACTCT AGCCTTGGTG P< C AGC AAGAC 
AAGAGAAATC GGGCAACTTC CCCAAGATCG C GCAGTTAAC 
AACTCGAAGT CTTAATCAGG ACACTCTACC P. AATGAGATC 
GCATCCAGTA TGAAGACTGG ACCAGCAGGG AGAACTATGA 
AAGCAGATTT CACAGCCTCA GAGGTGGCAC AGGCTGACTC 
ACCAGCCCAG AAACAGTGAC CCAGAATCAC AGGGAAGTAG 
AAGCCCCTCC TTGACCCCAT GCTCCTTACC CTCAGGGGCG 
GGCTCAAAGG TCTTGACGGT GGAGAACACC ATGCCCAGGG 
ATCAAAGCGT TAATTCTGAG ATGGGCCTGC CCGGGTGCGG 
GGGTTAACTG CCCCGGGCCT TCGCCGTGGG GGCGGGGCCT 



.CATGATACA 
CCAGTGCTTA 
ATAACCTCAT 
TTACAGAGGA 
TGGCCTTCAG 
ATTATATTGC 
AGTGACTACT 
AACAACCCTG 
AATCTGGGGC 
TCTAAAAAAA 
GGGAGGCTAA 
CAAAACCCTG 
ATCCCAGCTA 
CAGAGAGCCG 
CGAAAAAAAT 
CATGCCTGTA 
GTCAAGGCTG 
CCTGTCTCAA 
TAGTGGCATA 
AACGGCTCAG 
TGCGTACAGC 
ACAACCCGGG 
AAATGGGATT 
CAGGAGTTAG 
ATTCCCGACG 
ACTCTGCCGC 
CGGGGAGGGT 



TTATTTTATA 6 0 

GCACAAATCA 12 0 

TGGATTCTCA 180 

AGAAACAGGC 240 

GTATTCATGC 3 00 

CTCCTTATAG 3 60 

ATGTACCCAG 42 0 

TGAGGTAACT 480 

TGGGCGTAGT 540 

ATTTTTTTTT 600 

GGCAGGCAGA 66 0. 

TGTCTACTAA 72 0 

CTCAGGAGGC 780 

AGATCGTGCC 840 

AAAAATAAAA 900 

GTCCCAGCTA 960 

C AGTGGGCTG 102 0 

AAAAAAAAAA 1080 

GCTTCACTCA 1140 

TAATGGATTG 1200 

CTAGAGCCTG 12 60 

GC AGAAAGGG 132 0 

CGGC AC AATG 1380 

TCGCTCAGGC 1440 

CGGTGATGCC 1500 

AGC AAGAGAA 1560 

CACAGCCCGG 162 0 
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GACTGAGACC CGAGGTTAAC CGCCCGGGGT GGGCTCCACG GGGGGGGGGC ATGCTCTCCG 1680 

CGGCTGCTGC CGGTATAGAG CGGTAACTGC CCAGGAGGGG GCGC^GGCCCC ACAGGGGCGT 174 0 

GGCCTCGGAG CTGCACGGCC GTGGGCGGCG ATGAGAGGGT TAAGCCCCAG AGGGCCCTGG 1800 

AGGGGC GGGG CCGCGGGACG GGCTCGGCCC AAGGGAGGAG (CTGGGGGCGG AAGCGGCCGG I860 

CGGTCTGCGC CCTGCGCGCC TCGGCTTCTT TCCGCCCGGc/tCCTTCAGAG GCCCGGCGAC 192 0 

CTCCAGGGCT GGGAAGTCAA CCGAGGTTCG GGGGCAGC^G CGAGGGCTCC GGGCGAGTAA 1980 

GGGGGATGGT CCATGCTGAG GCCCAAATGG GGCGAACTCG CGAGAGTCTC TGGCGACCTG 2 04 0 

GATCAGATGG GGCGAGGGCA GATGAAGGGC CCAGGAGCTT TGGGGCAGCG AGGAGGGAGG 2100 

AGCGGGCCCG TTGGCAAACT TGGGTGAAAG GATGGGGTAC CTGGGTGACG AGCCCCCGCC 2160 

AGGATTCTGC TCTTCACGCC CCTTTTCTCC CAGCTCCCTT CCAGGTCAAT CCAAACTGGA 222 0 

GCTCAACTTT CAGAAGAGAA AGACGCCCCA GCAAGCCTCT TTCGGGGAGT CCTCTAGCTC 22 80 

CTCACCTCCA TGGGCCAGAC AGCTCTGGCA GGGGGCAGCA GCAGCACCCC CACGCCACAG 2 340 

GCCCTGTACC CTGACCTCTC CTGTCCCGAG GffiCTTGGAAG AGCTGCTGTC TGCACCCCCT 2 4 00 

CCTGACCTGG GGGCCCAGCG GCGCCACGGT TGGAACCCCA AAGACTGTTC AGAGAACATC 2460 

Ca V GAGGTCAAGG AAGGAGGGTT GTACTTTGAG /CGGCGGCCCG TGGCCCAGAG CACTGATGGG 2 52 0 

GCCCGGGGTA AGAGGGGCTA TTCAAGGGGCJ CTGCACGCCT GGGAGATCAG CTGGCCCCTA 2 580 

GAGCAGAGGG GCACGCATGC CGTGGTGGGC GTGGCCACGG CCCTCGCCCC GCTGCAGACT 2 64 0 

GACCACTACG CGGCGCTGCT GGGCAGCAAC AGCGAGTCGT GGGGCTGGGA CATCGGGCGG 2700 

GGGAAGCTGT ACCATCAGAG CAAGGGGGCC GGAGCCCCCC AGTATCCAGC GGGAACTCAG 2760 

GGTGAGCAGC TGGAGGTGCC AGAGAGACTG CTGGTGGTTC TGGACATGGA GGAGGGAACT 2 82 0 

CTGGGCTACG CTATTGGGGG CACCTAuCTG GGGCCAGCAT TCCGCGGACT GAAGGGCAGG 2 880 

ACCCTCTATC CGGCAGTAAG CGCTGTCTGG GGCCAGTGC.C AGGTCCGCAT CCGCTACCTG 2 940 

GGCGAAAGGA GAGGTGAGGC CTGGGGCAGA CGTGGGGAGA ACTTTCTGTC CCTGGTGGCA 3 000 

GTGGTTTGGG ATGGAAACTC TTCTGACAAG AGCAGAGGGG ATGGACCTTC ATCCAGCCTG 3 0 60 

CCTCAACCTC TGTTCAGTGC TGGGA kAGGC TAGGGGTCTT CACAGCTGTT ATTTAATTTA 312 0 

ACCCAACAGC AATAGAGGTG AAAC A SGCTT GAGAAAGCAA CTTTCTCAAG TTCTCTTGGC 3180 

CAGTAAATGG TGAACCTTCA GAATG3AGGG AGGAACTGCA GGGATGAGAG AATTCAGGAG 32 40 

ATATCAACCC CTGAGCAAGA GGTGCAAAGC GTTAGGTACT GGGTTTGATG TACAGGTCCA 3 3 00 



- 170- 



AAAGAAGGAT GGGCAGAGCC AGGTACCCAG GCTGTATACC GGAOTCCCTG GGCTCTAACC 33 60 

TGTCTCTGTG CCACATACCT ACTTCCTTCC TCAGCCACAC CTOTGGATGG AGACACTGGG 3 42 0 

GCCCTGGGCA CCAGGGAGGA GAGCAGTGGA GGAGGCAGGG OCTTAGGGTG GGGCAGCAGG 3 48 0 

GGAGGAGCCT CCCCAGGAAC TGACTGGGTC CAGGGCTTGg/aGCTGCTCTC TGCAGTTGTG 3 54 0 

TGGGCTGTAG AGTGGAGGGC CATCCCTCCT CACCTCAGQx: CCAGCTCCCA AGCCTCTGGA 3 600 

GTCAAAGCCT GGGCCAGCTC CACCACTGTC AGAGCCAfCCT TGGCCTGTTG TTTAGAGGGC 3 66 0 

CTTAGCCAGC TCTTCACCCC CAGCTCTGAC TAGGGAO?GTG TGAAATCTTA TCTGGGAGGC 3 72 0 

AGAACTTCCG GGTATCTCAA ATTCCCCTTT CAGCG/aGGTG GGCACACTCG AAGCAGGAAA 3780 

GCAGAAAGGC ATCTGAGTAG GACCCCGTAG TTTGAGGACA TCTGGCTGGT GGCTGCACCC 3 840 

ATACTTACAT TCCCCTCCTT CTCTCTCCCA GCGGAGCCAC ACTCCCTTCT GCACCTGAGC 3 9 00 

CGCCTGTGTG TGCGCCACAA CCTGGGGGAT AGTCCGGCTCG GCCAGGTGTC TGCCCTGCCC 3960 

TTGCCCCCTG CCATGAAGCG CTACCTGCTC TCVCCAGTGAG CCCTGTGATA CCACAGACTG 4 02 0 

TGCTGAGGTC TTGCCACCAC CCCTCCCCTT JGGGGAGGTGG GGAGGCACTG CTGGCCTAGA 4 080 

CCAGCTGCTG AAAGCTGGTG AGGCTGAGCQ CCTACCCCAA CCCAAGCTCT GCGGAAATCA 414 0 

ACAGCCCCAG AGCCACTTGG AGGGAGGAAG AAAGGGAGCC GGCGTTCAAG GCTATGACAG 42 00 

TCTGCTACGC AAAACATTTT TTCAAGTAAA AATAGTAAGA GATGTTGTTA TAGAAACCTG 42 6 0 

TTCTTGTTTT TTTTTTTTTC TTGCACAAAT GATCATTTAT ATAGCTGCCT CAAAAAGGAA 432 0 

GATTATCTGG GCAAGTCCAG TGAAGGQAGA CAAACCACAA GACCTAGTGC CAGGTTTATT 43 80 

CCCTCACATG GGTGGTTCAC ATACACAGCA CAGAGGCACG GGCACCATGG GAGAGGGCAG 4440 

CACTCCTGCC TTCTGAGGGG ATCTTGGCCT CACGGTGTAA GAAGGGAGAG GATGGTTTCT 4500 

CTTCTGCCCT CACTAGGGCC TAGGGAACCC AGGAGCAAAT CCCACCACGC CTTCCATCTC 45 60 

TCAGCCAAGG AGAAGCCACC TTGGTGACGT TTAGTTCCAA CCATTATAGT AAGTGGAGAA 4 62 0 

GGGATTGGCC TGGTCCCAAC CATTACAGGG TGAAGATATA AACAGTAAAG GAAGATACAG, 4680 

TTTGGATGAG GCCACAGGAA GGAGCAGATG ACACCATCAG AAGCATATGC AGGGAAAGGG 474 0 

CAGTTACTGG GCTTCTGGGC TGCTTAGTCC CTGGCTTGGC AGGAAGGGTA GGGAAGATGG 4800 

ATGGGGCTCA TTGTTTGGCA TTGfATGATGT CCACGAATTC GGGCTTGAGG GAAGCACCAC 4860 

CCACAAGGAA GCCATCCACA TCAGGCTGGC TGGCCAGCTC CTTGCAGGTT GCCCCAGTCA 492 0 

CAGAGCCTGG GAAGGGAGCA GAACAAGGGC TTGGTCAAGA ATGGGATGAG TCTGCCCCAT 4980 



CCCCACCTCC 
CCAAAGCCAC 
GGACTTCAGC 
CTAAGTAAGA 
GTGTGGTTTC 
ATGCCCTTGG 
TTGCCCCATG 
CTCCTGGGCC 
GGCTCATAGG 
AGATGGAGGG 
ACGGTGGCCT 



ATGTCCGAGG 
TCACCTCCAT 
CATCCTCGGA 
CCTTTTCTGC 
TGACTGAGTC 
ACAAGTCACT 
TGGGCTGTGT 
CGGTTACCTG 
CCAGGACGAC 
AAGGGTGAAC 
ACTGCCCCCT 



GCTCAGTCTA 
AAATGATACG 
GCTTCTCGTG 
CTCTCTAAGA 
AGAGTACCAG 
GTCTCTGGGT 
CTGTCCAAAC 
TTGGGGTGTT 
CTTGCTCCAG 
AAGAAAGAGC 
AGTGGACATT 
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GTCCTCAGCC 

GGTGCTCTGA 

TACTTCCTGG 

GGAAAAATCA 

GGCTCTGATCy 

TCAAGGTCT^ 

CTATTGAGGC 

GCAGTCTTGC 

TCCTTGACGT 

tctcc^gcca 
ggggI 



CACTGCACCT 
GCCACCGCAT 
GQCTAGAACA 
'GGCACCAG 
CAAGCCAGGC 
TGTGTCTTTG 
AGGCTGGGAT 
CAGTACCAAT 
TATCTGCAGG 
GGTTCTCCGG 



CAGCCGGGAA 5040 

CAGAGACGTT 5100 

AGAAGCTGGC 516 0 

TGGAC ACTTA 522 0 

CCTGGACTGG 5280 

AAATAAGGGG 5340 

GAGGGCAGGG 5400 

GGCCCACACA 5460 

GC AGAGATAC 552 0 

AGTACGAAGA 5580 
5615 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 63 amino acjLds 
\ (B) TYPE: amino acid 

(C) STRANDEDNESS: singl< 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SIIQ ID NO:48: 



Met Gly Gin Thr Ala Leu Ala 
1 5 

Gin Ala Leu Tyr Pro Asp Leu 
20 

Leu Ser Ala Pro Pro Pro Asp 
35 

Asn Pro Lys Asp Cys Ser Glu 
50 55 



Gly Gly Ser Ser Ser Thr Pro Thr Pro 
10 15 

Ser Cys Pro Glu Gly Leu Glu Glu Leu 
25 30 

Leu Gly Ala Gin Arg Arg His Gly Trp 
40 45 

Asn lie Glu Val Lys Glu Gly Gly Leu 

60 



Tyr Phe Glu Arg Arg Pro Val lAla Gin Ser Thr Asp Gly Ala Arg Gly 
65 70 1 75 80 

Lys Arg Gly Tyr Ser Arg Gly Leu His Ala Trp Glu lie Ser Trp Pro 

85 \ 90 95 
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Leu Glu Gin Arg Gly Thr His Ala Val Val Gly Vgft Ala 
100 105 

Ala Pro Leu Gin Thr Asp His Tyr Ala Ala Ley Leu Gly 
115 120 / 125 

Glu Ser Trp Gly Trp Asp lie Gly Arg Gly ftys Leu Tyr 
130 135 / 140 

Lys Gly Pro Gly Ala Pro Gin Tyr Pro Al& Gly Thr Gin 
145 150 / 155 

Leu Glu Val Pro Glu Arg Leu Leu Val X/al Leu Asp Met 

165 7170 

Thr Leu Gly Tyr Ala lie Gly Gly Tiif Tyr Leu Gly Pro 
180 11 

Gly Leu Lys Gly Arg Thr Leu Tyr p/:o Ala Val Ser Ala 
195 200 / 205 

Gin Cys Gin Val Arg lie Arg Tyr /Leu Gly Glu Arg Arg 
210 215 / 220 

His Ser Leu Leu His Leu Ser Ard Leu Cys Val Arg His 
225 230 / 235 

Asp Thr Arg Leu Gly Gin Val S^r Ala Leu Pro Leu Pro 

245 / 250 

Lys Arg Tyr Leu Leu Tyr Gin 
260 



(2) INFORMATION FOR SEQ ID NO: 49 

(i) SEQUENCE CHARACTERISTIC]! 

(A) LENGTH: 2 8 bas^ p^irs 

(B) TYPE: nucleic aci< 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION :/ SEQ ID NO:49: 
AGCTAGATCT GGACCCTACA ATGGCAGC/ 
(2) INFORMATION FOR SEQ ID NO:f50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base (pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linead 



Thr Ala Leu 
110 

Ser Asn Ser 

His Gin Ser 

Gly Glu Gin 
160 

Glu Glu Gly 
175 

Ala Phe Arg 
190 

Val Trp Gly 
Ala Glu Pro 



Asn Leu Gly 
240 

Pro Ala Met 
255 




/ 



